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PRODUCTION OF VERY LONG CHAIN POLYUNSATURATED FATTY ACIDS IN 

OILSEED PLANTS 
This application claims the benefit of U.S. Provisional Application 
5 No. 60/446,941. filed February 12, 2003. the disclosure of which is hereby 
incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
This Invention pertains to oilseed plants which have been transformed to 
produce very long chain polyunsaturated fatly acids and to recombinant constructs 
10 and method for producing such fatty acids in a plant. 

BACKGROUND OF THE INVENTION 
Lipids/fatty acids are water-insoluble organic biomolecules that can be 
extracted from cells and tissues by nonpolar solvents such as chloroform, ether or 
benzene. Lipids have several important biological functions, serving (1) as 
15 structural components of membranes, (2) as storage and transport forms of 

metabolic fuel, (3) as a protective coating on the surface of many organisms, and 
(4) as celll-surface components concerned in cell recognition, species specificity 
and tissue imnriunity. 

The human body is capable of producing most of the fatty acids which it 
20 requires to function. Two long chain polyunsaturated fatty acids, eicosapentaenoic 
acid (EPA) and docosahexaenoic acid (DHA), however, cannot be synthesized 
efficiently by the human body and, thus, have to be supplied through the diet. Since 
the human body cannot produce adequate quantities of these polyunsaturated fatty 
acids, they are called essential fatty acids. 
25 PUFAs are important components of the plasma membrane of the cell, where 

they may be found In such forms as phospholipids and also can be found in 
triglycerides. PUFAs also serve as precursors to other molecules of importance In 
human beings and animals, including the prostacyclins, leukotrienes and 
prostaglandins. There are two main families of polyunsaturated fatty acids (PUFAs), 
30 specifically, the omega-3 fatty acids and the omega-6 fatty acids. 

DHA is a fatty acid of the omega-3 series according to the location of the last 
double bond in the methyl end. It is synthesized via alternating steps of 
desaturation and elongation. Production of DHA is Important because of its 
beneficial effect on human health. Cunrently the major sources of DHA are oils from 
35 fish and algae. 

EPA and afachidonic acid (AA) are both delta-5 essential fatty acids. EPA 
belongs to the omega-3 series with five double bonds In the acyl chain, Is found in 
marine food, and is abundant in oily fish from the North Atlantic. AA belongs to the 

1 



wo 2004/071467 



PCT/US2004/005758 



omega-6 series with four double bonds. The lack of a double bond in the omega-3 
position confers on AA different properties than those found in EPA. The 
eicosanoids produced from AA have strong inflamnnatory and platelet aggregating 
properties, whereas those derived from EPA have anti-inflammatory and anti- 
5 platelet aggregating properties. AA can be obtained from some foods such as meat, 
fish, and eggs, but the concentration is low. 

Gamma-linolenic acid (GLA) is another essential fatty acid found in 
mammals. GLA is the metabolic intermediate for very long chain omega-6 fatty 
acids and for various active molecules. In mammals, formation of long chain PUFAs 

10 is rate-limited by delta-6 desaturation. Many physiological and pathological 

conditions such as aging, stress, diabetes, eczema, and some infections have been 
shown to depress the delta-6 desaturation step. In addition, GLA is readily 
catabolized from the oxidation and rapid cell division associated with certain 
disorders, e.g., cancer or inflammation. 

15 Research has shown that omega-3 fatty acids reduce the risk of heart 

disease as well as having a positive effect on children's development. Results have 
been disclosed indicating the positive effect of these fatty acids on certain mental 
illnesses, autoimmune diseases and joint complaints. Thus, there are many health 
benefits associated with a diet supplemented with these fatty acids. 

20 Unfortunately, there are several disadvantages associated with commercial 

production of PUFAs from natural sources. Natural sources of PUFAs, such as 
animals and plants, tend to have highly heterogeneous oil compositions. The oils 
obtained from these sources can require extensive purification to separate out one 
or more desired PUFAs or to produce an oil which is enriched in one or more 

25 PUFAs. Natural sources also are subject to uncontrollable fluctuations in 

availability. Fish stocks may undergo natural variation or may be depleted by 
overfishing. Fish oils have unpleasant tastes and odors which may be difficult, if not 
impossible, to economically separate from the desired product, and can render such 
products unacceptable as food supplements. Animal oils and. in particular, fish oils, 

30 can accumulate envrionmental pollutants. Weather and disease can cause 
fluctuation in yields from both fish and plant sources. 

An expansive supply of polyunsaturated fatty acids from natural sources and 
from chemical syntheis are not sufficient for commercial needs. Therefore, it is of 
interest to find alternative means to allow production of commercial quantities of 

35 PUFAs. Biotechnology offers an attractive route for producing LCPUFAs in a safe, 
cost efficient manner. 

WO 02/26946, published April 4, 2002. describes isolated nucleic acid 
molecules encoding FAD4, FAD5, FAD5-2 and FAD6 fatty acid desaturase family 
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members which are expressed in LCPUFA-producing organisms, e.g., 
Thraustochytrium, Pythium irregulare, Schizichytrium and Crypthecodinium. It is 
indicated that constructs containing the desaturase genes can be used in any 
expression system including plants, animals, and microorganisms for the production 
5 of cells capable of producing LCPUFAs. 

WO 02/26946, published April 4, 2002, describes FAD4, FADS, FAD5-2, and 
FAD6 fatty acid desaturase members and uses thereof to produce long chain 
polyunsaturated fatty acids. 

WO 98/55625. published December 19, 1998, describes the production of 
10 polyunsaturated fatty acids by expression of polyketide-like synthesis genes in 
plants. 

WO 98/46764, published October 22, 1998, describes compositions and 
methods for preparing long chain fatty acids in plants, plant parts and plant cells 
which utilize nucleic acid sequences and constructs encoding fatty acid 
15 desaturases, including delta-5 desaturases, delta-6 desaturases and delta-12 
desaturases. 

U.S. Patent No. 6,075,183, issued to Knutzon et al. on June 13, 2000, 
describes methods and compositions for synthesis of long chain polyunsaturated 
fatty acids in plants. 

20 U.S. Patent No. 6,459,018, issued to Knutzon on October 1, 2002, describes 

a method for producing stearidonic acid in plant seed utilizing a construct 

comprising a DNA sequence encoding a delta-six desaturase. 

Spychalla et al., Proc. NatL Acad. ScL USA, Vol.94, 1142-1147 (Feb. 1997), 

describes the isolation and characterization of a cDNA from C. elegans that, when 
25 expressed in Arabidopsis, encodes a fatty acid desaturase which can catalyze the 

introduction of an omega-3 double bond into a range of 1 8- and 20-carbon fatty 

acids. 

SUMMARY OF THE INVENTION 
The invention includes an oilseed plant that produces mature seeds in which 
30 the total seed fatty acid profile comprises at least 1 .0 % of at least one 

polyunsaturated fatty acid having at least twenty carbon atoms and five or more 
carbon-carbon double bonds. 

In a second embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
35 5.0 % of at least one polyunsaturated fatty acid having at least twenty carbon atoms 
and five or more carbon-carbon double bonds. 

In a third embodiment, this invention includes an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 10.0 % of 

3 
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at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds. 

Additional embodiments of this invention include an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
6 15.0 %, 20 %. 25 %. 30 %. 40 %, 50 %, or 60 % of at least one polyunsaturated 
fatty acid having at least twenty carbon atoms and five or more carbon-carbon 
double bonds. 

In a fourth embodiment this Invention includes an oilseed plant that produces 
mature seeds In which the total seed fatty acid profile comprises at least 10,0 % of 
10 at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds and less than 2.0% arachidonic acid. 

Again additional embodiments would include an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 15.0 %, 
20 %, 25 %, 30 %, 40 %, 50 %, or 60 % of at least one polyunsaturated fatty acid 
15 having at least twenty carbon atoms and five or more carbon-carbon double bonds 
and less than 2.0 % arachidonic acid. 

The PUFA can be an omega-3 fatty acid selected from the group consisting 
of EPA. DPA and DHA. 

Also of interest are seeds obtained from such plants and oil obtained from the 
20 seeds of such plants. 

In a fifth embodiment, this invention includes a recombinant construct for 
altering the total fatty acid profile of mature seeds of an oilseed plant, said construct 
comprising at least two promoters wherein each promoter is operably linked to a 
nucleic acid sequence encoding a polypeptide required for making at least 
25 one polyunsaturated fatty acid having at least twenty carbon atoms and four or more 
carbon-carbon double bonds and further wherein the total fatty acid profile 
comprises at least 2 % of at least one polyunsaturated fatty acid having at least 
twenty carbon atoms and four or more carbon-carbon double bonds and further 
wherein said polypeptide is an enzyme selected from the group consisting of a A4 
30 desaturase, a A5 desaturase, a A6 desaturase, a A15 desaturase, a A17 
desaturase, a CIS to C22 elongase and a C20 to C24 elongase. 

In a further aspect, the promoter is selected from the group consisting of the 
alpha prime subunit of beta conglycinin promoter, Kunitz trypsin inhibitor 3 promoter, 
annexin promoter, Glyl promoter, beta subunit of beta conglycinin promoter, 
35 P34/Gly Bd m 30K promoter, albumin promoter, Leg A1 promoter and Leg A2 
promoter. Also of interests are oilseed plants comprising in their genome such 
recombinant constructs, seeds obtained from such plants and oil obtained from the 
seeds of such plants. 
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In a sixth embodiment, this invention includes a method for making an 
oilseed plant having an altered fatty acid profile which comprises: 

a) transforming a plant with the recombinant construct of the fifth 
embodiment; 

5 b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
least 1 .0 % of at least one polyunsaturated fatty acid having at least 
twenty carbon atoms and five or more carbon-carbon double bonds. 
In a seventh embodiment, this invention includes a method for making an oilseed 
10 plant having an altered fatty acid profile which comprises: 

a) transforming a plant with the recombinant construct of the fifth 
embodiment including any one of the promoters recited therein, 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
15 least 1 .0 % of at least one polyunsaturated fatty acid having at least 

twenty carbon atoms and five or more carbon-carbon double bonds. 
Also of interest are oilseed plants made by such methods, seeds obtained 
from such plants and oil obtained from the seeds of such plants. 

In an eighth embodiment, this invention includes a food product, beverage, 
20 infant formula, or nutritional supplement Incorporating any of the oils of the 
invention. 

In a ninth embodiment, this invention Includes a food product, pet food or 
animal feed which has incorporated therein any of the seeds of the invention. 

In a tenth embodiment, this invention includes an oilseed plant that produces 

25 mature seeds in which the total seed fatty acid profile comprises polyunsaturated 
fatty acids having at least twenty carbon atoms and five or more carbon-carbon 
double bonds wherein the ratio of EPA:DHA is in the range from 1:100 to 860:100. 
The oilseed plant may further have a total seed fatty acid profile comprising less 
than 2.0% arachidonic acid. Also of interest are seeds obtained from such plants 

30 and oil obtained from the seeds of such plants. 

In an eleventh embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises 
polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of DHA:EPA is in the range from 

35 1 :1 00 to 1 1 0:1 00. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 



5 



wo 2004/071467 PCT/US2004/005758 

BIOLOGICAL DEPOSITS 
The following plasmlds have been deposited with the American Type Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 2011 0-2209, and 
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bears the following designation, accession number and date of deposit. 


Plasmid 


Accession Number 


Date of Deoosit 


pKR274 


ATCC PTA-4988 


Jan. 30. 2003 


pKR275 


ATCC PTA-4989 


Jan. 30, 2003 


pKR357 


ATCC PTA-4990 


Jan, 30, 2003 


pKR365 


ATCC PTA-4991 


Jan. 30, 2003 


pKKE2 


ATCC PTA-4987 


Jan. 30, 2003 



BRIEF DESCRIPTION OF THE 
DRAWINGS AND SEQUENCE LISTINGS 
The Invention can be more fully understood from the following detailed 
10 description and the accompanying drawings and Sequence Listing, which form a 
part of this application. 

The sequence descriptions summarize the Sequences Listing attached 
hereto. The Sequence Listing contains one letter codes for nucleotide sequence 
characters and the single and three letter codes for amino acids as defined in the 
15 lUPAC-lUB standards described in Nucleic Acids Research f 3:3021-3030 (1985) 
and in the BiochemicalJoumal 219 (No. 2;;345"373 (1984). 

Figure 1 shows possible biosynthetic pathways for PUFAs, 
Figure 2 shows possible pathways for production of LC-PUFAs included EPA 
and DHA compiled from a variety of organisms. 
20 Figure 3 is a schematic depiction of plasmid pKR274. 

Figure 4 is a schematic depiction of plasmid pKKE2. 
Figure 5 is a schematic depiction of plasmid pKR275. 
Figure 6 is a schematic depiction of plasmid pKR365. 
Figure 7 is a schematic depiction of plasmid pKR364. 
25 Figure 8 is a schematic depiction of plasmid pKR357. 

SEQ. ID. NO:1 sets forth oligonucleotide primer GSP1 used to amplify the 
soybean annexin promoter. 

SEQ. ID. NO:2 sets forth oligonucleotide primer GSP2 used to amplify the 
soybean annexin promoter. 
30 SEQ. ID. NO:3 sets forth the sequence of the annexin promoter. 

SEQ. ID. NO:4 sets forth oligonucleotide primer GSP3 used to amplify the 
soybean BD30 promoter. 
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SEQ. ID. NO:5 sets forth oligonucleotide primer GSP4 used to amplify the 
soybean BD30 promoter. 

SEQ. ID. NO:6 sets forth the sequence of the soybean BD30 promoter. 

SEQ. ID. NO:7 sets forth the sequence of the soybean p-conglycinln p- 
6 subunit promoter 

SEQ. ID. NO:8 sets forth oligonucleotide primer p-con oligo Bam used to 
amplify the promoter for soybean p-conglycinin P-subunit. V 

SEQ. ID. NO:9 sets forth oligonucleotide primer p-con oligo Not used to 
amplify the promoter for soybean p-conglycinin p-subunit. 
10 SEQ. ID, NO: 10 sets forth the sequence of the soybean glycinin Gly-1 

promoter. 

SEQ. ID. NO:11 sets forth oligonucleotide primer glyoligo Bam used to 
amplify the Gly-1 promoter. 

SEQ. ID. NO:12 sets forth oligonucleotide primer glyoligo Not used to amplify 
15 the Gly-1 promoter. 

SEQ. ID. NO: 13 sets forth oligonucleotide primer oCGR5-1. 

SEQ. ID. NO: 14 sets forth oligonucleotide primer oCGR5-2. 

SEQ. ID. NO:15 sets forth oligonucleotide primer oSAIb-9. 

SEQ. ID. NO: 16 sets forth oligonucleotide primer oSAIb-3. 
20 SEQ. ID. NO: 17 sets forth oligonucleotide primer oSAIb-4. 

SEQ. ID. NO: 18 sets forth oligonucleotide primer oSAIb-2. 

SEQ. ID. NO:19 sets forth oligonucleotide primer LegProS'. 

SEQ. ID. NO:20 sets forth oligonucleotide primer LegPro3'. 

SEQ. ID. NO:21 sets forth oligonucleotide primer LegTermS'. 
25 SEQ. ID. NO:22 sets forth oligonucleotide primer LegTermS'. 

SEQ. ID. NO:23 sets forth oligonucleotide primer oKTiS. 

SEQ. ID. NO:24 sets forth oligonucleotide primer oKTi6. 

SEQ. ID. NO:25 sets forth oligonucleotide primer LegAIProS". 

SEQ. ID. NO:26 sets forth oligonucleotide primer LegA1Pro3'. 
30 SEQ. ID. NO:27 sets forth oligonucleotide primer LegA1Term5'. 

SEQ. ID. NO:28 sets forth oligonucleotide primer LegA1Term3'. 

SEQ. ID. NO:29 sets forth oligonucleotide primer annreamp5'. 

SEQ. ID. NO:30 sets forth oligonucleotide primer annreamp3'. 

SEQ. ID. NO:31 sets forth oligonucleotide primer BD30 reamp5'. 
35 SEQ. ID. NO:32 sets forth oligonucleotide primer BD30 reamp3'. 

SEQ. ID. NO:33 sets forth the sequence of the gene for Mortierella alpina 
delta-6 desaturase. 
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SEQ. ID. NO:34 sets forth the protein sequence of the Mortierella alpina 
delta-6 desaturase. 

SEQ. ID. NO:35 sets forth the sequence of the gene for Saprolegnia diclina 
delta-6 desaturase. 

SEQ. ID. NO:36 sets forth the protein sequence of the Saprolegnia diclina 
delta-6 desaturase. 

SEQ. ID. NO:37 sets forth the sequence of the gene for Saprolegnia diclina 
delta-5 desaturase. 

SEQ. ID. NO:38 sets forth the protein sequence of the Saprolegnia diclina 
delta-5 desaturase. 

SEQ. ID. NO:39 sets forth the sequence of the gene for Thraustochytrium 
aureum elongase. 

SEQ. ID. NO:40 sets forth the protein sequence of the Thraustochytrium 
aureunn elongase. 

SEQ. ID. NO:41 sets forth the sequence of the gene for Saprolegnia diclina 
delta-17 desaturase. 

SEQ. ID. NO:42 sets forth the protein sequence of the Saprolegnia diclina 
delta-17 desaturase. 

SEQ. ID. NO:43 sets forth the sequence of the gene for Mortierella alpina 
elongase. 

SEQ. ID. NO:44 sets forth the protein sequence of the Mortierella alpina 
elongase. 

SEQ. ID. NO:45 sets forth the sequence of the gene for Mortierella alpina 
delta-5 desaturase. 

SEQ. ID. NO:46 sets forth the protein sequence of the Mortierella alpina 
delta-5 desaturase. 

SEQ. ID. NO:47 sets forth the sequence of At FAD3, the gene for Arabidopsis 
thaliana delta-15 desaturase. 

SEQ. ID. NO:48 sets forth the protein sequence of the Arabidopsis thaliana 
delta-15 desaturase. 

SEQ. ID. NO:49 sets forth the sequence of the gene for Pavlova sp. 
elongase. 

SEQ. ID. NO:50 sets forth the protein sequence of the Pavlova sp. elongase. 

SEQ. ID. NO:51 sets forth the sequence of the gene for Schizochytrium 
aggregatum delta-4 desaturase. 

SEQ. ID. NO:52 sets forth the protein sequence of the Schizochytrium 
aggregatum delta-4 desaturase. 

SEQ. ID. NO:63 sets forth oligonucleotide primer RSP19F. 

8 
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SEQ. ID. NO:54 sets forth oligonucleotide primer RSP19R. 

SEQ, ID. NO:55 sets forth oligonucleotide primer RBP2F. . 

SEQ. ID. NO:56 sets forth oligonucleotide primer RBP2R. 

SEQ. ID. NO:57 sets forth oligonucleotide primer CGR4F. 
5 SEQ, ID. NO:58 sets forth oligonucleotide primer CGR4R. 

SEQ. ID. NO:59 sets forth oligonucleotide primer oSGly-1 . 

SEQ. ID. NO:60 sets forth oligonucleotide primer oSGIy-2. 

SEQ. ID. NO:61 sets forth consensus desaturase Protein Motif 1. 

SEQ. ID. NO:62 sets forth oligonucleotide primer R01 144. 
10 SEQ. ID. NO:63 sets forth consensus desaturase Protein Motif 2. 

SEQ. ID. NO:64 sets forth oligonucleotide primer R01 119. 

SEQ. ID. NO:65 sets forth oligonucleotide primer R01118. 

SEQ. ID. NO:66 sets forth consensus desaturase Protein Motif 3. 

SEQ. ID. NO:67 sets forth oligonucleotide primer R01 121. 
15 SEQ. ID. NO:68 sets forth oligonucleotide primer R01 122. 

SEQ. ID. NO:6g sets forth consensus desaturase Protein Motif 4. 

SEQ. ID. NO:70 sets forth oligonucleotide primer R01 146. 

SEQ. ID. NO:71 sets forth oligonucleotide primer R01147. 

SEQ. ID. NO:72 sets forth consensus desaturase Protein Motif 5. 
20 SEQ. ID. NO:73 sets forth oligonucleotide primer R01 148. 

SEQ. ID. NO:74 sets forth consensus desaturase Protein Motif 6. 

SEQ. ID. NO:75 sets forth oligonucleotide primer R01 1 14. 

SEQ. ID. NO:76 sets forth consensus desaturase Protein Motif 7. 

SEQ. ID. NO:77 sets forth oligonucleotide primer R01 116. 
25 SEQ. ID. NO:78 sets forth consensus desaturase Protein Motif 8. 

SEQ. ID. NO:80 sets forth oligonucleotide primer R01189. 

SEQ. ID. NO:81 sets forth oligonucleotide primer RO1190. 

SEQ. ID. NO:82 sets forth oligonucleotide primer R01191. 

SEQ. ID. NO:83 sets forth oligonucleotide primer R0898. 
30 SEQ. ID. NO:84 sets forth oligonucleotide primer R0899. 

SEQ. ID. NO:85 sets forth oligonucleotide primer R01 185. 

SEQ. ID. NO:86 sets forth oligonucleotide primer R01 186. 

SEQ. ID. NO:87 sets forth oligonucleotide primer R01187. 

SEQ. ID. NO:88 sets forth oligonucleotide primer R01212. 
35 SEQ. ID. NO:89 sets forth oligonucleotide primer R01213. 

SEQ. ID. NO:90 sets forth the sequence of the expression cassette that 
comprises the constitutive soybean S-adenosylmethionine synthetase (SAMS) 

9 
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promoter operably linked to a gene for a form of soybean acetolactate synthase 
(ALS) that is capable of conferring resistance to sulfonylurea herbicides. 
SEQ. ID. NO:91 sets forth oligonucleotide primer oSBD30-1 . 
SEQ. ID. NO:92 sets forth oligonucleotide primer oSBD30-2. 
5 SEQ. ID. NO:93 sets forth oligonucleotide primer T7pro. 

SEQ. ID. NO:94 sets forth oligonucleotide primer R01327. 
SEQ. ID. NO:95 sets forth oligonucleotide primer GenRacer3\ 
SEQ. ID. NO:96 sets forth oligonucleotide primer oCal-26. 
SEQ. ID. NO:97 sets forth oligonucleotide primer oCal-27. 
10 SEQ. ID. NO:98 sets forth oligonucleotide primer oKTI7. 

DETAILED DESCRIPTION OF THE INVENTION 
All patents, patent applications, and publications cited are incorporated herein 
by reference in their entirety. 

In the context of this disclosure, a number of terms shall be utilized. 
15 Fatty acids are described herein by a numbering system in which the number 

before the colon indicates the number of carbon atoms in the fatty acid, whereas the 
number after the colon is the number of double bonds that are present. The number 
following the fatty acid designation indicates the position of the double bond from 
the carbocyl end of the fatty acid with the "c" affix for the cis-configuratlon of the 
20 double bond, e.g., palmitic acid (16:0), stearic acid (18:0), oleic acid (18:1,9c), 
petroselinic acid (18:1, 6c), linoleic acid (18:2,9c,12c), y-linolenic acid (18:3, 
6c,9c,12c) and a-linolenic acid (18:3, 9c, 12c, 15c). Unless othenA/ise specified 18:1, 
18:2 and 18:3 refer to oleic, linoleic and linolenic fatty acids. 

"Omega-3 fatty acid" (also referred to as an n-3 fatty acid) includes the 
25 essential fatty acid a-linolenic acid (18:3n-3) (ALA) and its long-chain metabolites. 
In n-3 fatty acids, the first double bond is located at the third carbon from the methyl 
end of the hydrocarbon chain. For n-6 fatty acids, it is located at the sixth carbon, 
Eicosapentaneoic acid (EPA), docosapentaenoic acid (DPA), and 
docosahexanenoic acid (DHA) are examples of omega-3 fatty acids. 
30 "Desaturase" is a polypeptide which can desaturate one or more fatty acids to 

produce a mono- or poly-unsaturated fatty acid or precursor which is of interest. 

A "food analog" is a food-like product manufactured to resemble its food 
counterpart, whether meat, cheese, milk or the like, and is intended to have the 
appearance, taste, and texture of its counterpart. 
35 "Aquaculture feed" refers to feed used in aquafarming which concerns the 

propagation, cultivation or farming of aquatic organisms, animals and/or plants In 
fresh or marine waters. 
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The terms "polynucleotide", "polynucleotide sequence", "nucleic acid 
sequence", and "nucleic acid fragment"/"isolated nucleic acid fragment" are used 
interchangeably herein. These terms encompass nucleotide sequences and the 
like. A polynucleotide may be a polymer of RNA or DNA that is single- or double- 
5 stranded, that optionally contains synthetic, non-natural or altered nucleotide bases. 
A polynucleotide in the form of a polymer of DNA may be comprised of one or more 
segments of cDNA, genomic DNA, synthetic DNA, or mixtures thereof. Nucleotides 
(usually found in their 5'-monophosphate form) are referred to by a single letter 
designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, 

10 respectively), "C" for cytidylate or deoxycytidylate, "G" for guanylate or 

deoxyguanylate, "U" for uridylate, "T' for deoxythymidylate, "R" for purines (A or G), 
"Y" for pyrimidines (C or T). "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" 
for any nucleotide. 

The terms "subfragment that is functionally equivalent" and "functionally 

15 equivalent subfragment" are used interchangeably herein. These terms refer to a 
portion or subsequence of an isolated nucleic acid fragment in which the ability to 
alter gene expression or produce a certain phenotype is retained whether or not the 
fragment or subfragment encodes an active enzyme. For example, the fragment or 
subfragment can be used in the design of chimeric genes to produce the desired 

20 phenotype in a transformed plant. Chimeric genes can be designed for use in 

suppression by linking a nucleic acid fragment or subfragment thereof, whether or 
not it encodes an active enzyme, in the sense or antisense orientation relative to a 
plant promoter sequence. 

The terms "homology", "homologous", "substantially similar" and 

25 "corresponding substantially" are used interchangeably herein. They refer to nucleic 
acid fragments wherein changes in one or more nucleotide bases do not affect the 
ability of the nucleic acid fragment to mediate gene expression or produce a certain 
phenotype. These terms also refer to modifications of the nucleic acid fragments of 
the instant invention such as deletion or insertion of one or more nucleotides that do 

30 not substantially alter the functional properties of the resulting nucleic acid fragment 
relative to the initial, unmodified fragment. It is therefore understood, as those 
skilled in the art will appreciate, that the invention encompasses more than the 
specific exemplary sequences. 

Moreover, the skilled artisan recognizes that substantially similar nucleic acid 

35 sequences encompassed by this invention are also defined by their ability to 

hybridize, under moderately stringent conditions (for example, 0.5 X SSC, 0.1% 
SDS, 60^*0) with the sequences exemplified herein, or to any portion of the 
nucleotide sequences disclosed herein and which are functionally equivalent to any 

11 
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of the nucleic acid sequences disclosed herein. Stringency conditions can be 
adjusted to screen for moderately similar fragments, such as homologous 
sequences from distantly related organisms, to highly similar fragments, such as 
genes that duplicate functional enzymes from closely related organisms. Post- 
5 hybridization washes determine stringency conditions. One set of preferred 
conditions involves a series of washes starting with 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45^*0 for 30 min, 
and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 30 mm. A more 
preferred set of stringent conditions involves the use of higher temperatures in 

1 0 which the washes are identical to those above except for the temperature of the final 
two 30 min washes in 0.2X SSC, 0.5% SDS was increased to 60**C. Another 
preferred set of highly stringent conditions involves the use of two final washes in 
0.1X SSC, 0.1% SDS at 65"C. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 

15 including regulatory sequences preceding (5* non-coding sequences) and following 
(3' non-coding sequences) the coding sequence, "Native gene" refers to a gene as 
found in nature with its own regulatory sequences. "Chimeric gene" refers any gene 
that is not a native gene, comprising regulatory and coding sequences that are not 
found together in nature. Accordingly, a chimeric gene may comprise regulatory 

20 sequences and coding sequences that are derived from different sources, or 

regulatory sequences and coding sequences derived from the same source, but 
arranged in a manner different than that found in nature. A "foreign" gene refers to 
a gene not normally found in the host organism, but that is introduced into the host 
organism by gene transfer. Foreign genes can comprise native genes inserted into 

25 a non-native organism, or chimeric genes. A "transgene" is a gene that has been 
introduced into the genome by a transformation procedure. An "allele" is one of 
several alternative forms of a gene occupying a given locus on a chromosome. 
When all the alleles present at a given locus on a chromosome are the same that 
plant is homozygous at that locus. If the alleles present at a given locus on a 

30 chromosome differ that plant is heterozygous at that locus. 

"Coding sequence" refers to a DNA sequence that codes for a specific amino 
acid sequence. "Regulatory sequences" refer to nucleotide sequences located 
upstream (5' non-coding sequences), within, or downstream (3' non-coding 
sequences) of a coding sequence, and which influence the transcription, RNA 

35 processing or stability, or translation of the associated coding sequence. Regulatory 
sequences may include, but are not limited to, promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences. 
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"Promoter" refers to a DNA sequence capable of controlling the expression of 
a coding sequence or functional RNA, The promoter sequence consists of proximal 
and more distal upstream elements, the latter elements often referred to as 
enhancers. Accordingly, an "enhancer" is a DNA sequence that can stimulate 
5 promoter activity, and may be an innate element of the promoter or a heterologous 
element inserted to enhance the level or tissue-specificity of a promoter Promoters 
may be derived in their entirety from a native gene, or be composed of different 
elements derived from different promoters found in nature, or even comprise 
synthetic DNA segments. It is understood by those skilled in the art that different 

10 promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental 
conditions. It is further recognized that since in most cases the exact boundaries of 
regulatory sequences have not been completely defined, DNA fragments of some 
variation may have identical promoter activity. Prornoters that cause a gene to be 

15 expressed in most cell types at most times are commonly referred to as "constitutive 
promoters". New promoters of various types useful in plant cells are constantly 
being discovered; numerous examples may be found in the compilation by 
Okamuro, J. K., and Goldberg, R. B. (1989) Biochemistry of Plants 15:1-82. 
The "translation leader sequence" refers to a polynucleotide sequence 

20 located between the promoter sequence of a gene and the coding sequence. The 
translation leader sequence is present in the fully processed mRNA upstream of the 
translation start sequence. The translation leader sequence may affect processing 
of the primary transcript to mRNA, mRNA stability or translation efficiency. 
Examples of translation leader sequences have been described (Turner, R. and 

25 Foster, G. D. (1995) MoL BiotechnoL 3:225-236). 

The "3' non-coding sequences" or "transcription terminator/termination 
sequences" refer to DNA sequences located downstream of a coding sequence and 
include polyadenylation recognition sequences and other sequences encoding 
regulatory signals capable of affecting mRNA processing or gene expression. The 

30 polyadenylation signal is usually characterized by affecting the addition of 

polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 
3' non-coding sequences is exemplified by Ingelbrecht, I. L., et al. (1989) Plant Cell 
1:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
35 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary transcript. 
An RNA transcript is referred to as the mature RNA when it is an RNA sequence 
derived from post-transcriptional processing of the primary transcript. "Messenger 
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RNA (mRNA)" refers to the RNA that is without introns and that can be translated 
into protein by the cell. "cDNA" refers to a DNA that is complementary to and 
synthesized from a mRNA template using the enzyme reverse transcriptase. The 
cDNA can be single-stranded or converted into the double-stranded form using the 
Klenow fragment of DNA polymerase I. "Sense" RNA refers to RNA transcript that 
includes the mRNA and can be translated into protein within a cell or in vitro. ' 
"Antisense RNA" refers to an RNA transcript that Is complementary to all or part of a 
target primary transcript or mRNA, and that blocks the expression of a target gene 
(U.S. Patent No. 5,107,065). The complementarity of an antisense RNA may be 
with any part of the specific gene transcript, I.e.. at the 5' non-coding sequence, 
3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers 
to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet 
has an effect on cellular processes. The terms "complement" and "reverse 
complement" are used interchangeably herein with respect to mRNA transcripts, 
and are meant to define the antisense RNA of the message. 

The tenri "operably linked" refers to the association of nucleic acid sequences 
on a single nucleic acid fragment so that the function of one is regulated by the 
other. For example, a promoter is operably linked with a coding sequence when It is 
capable of regulating the expression of that coding sequence (I.e., that the coding 
sequence is under the transcriptional control of the promoter). Coding sequences 
can be operably linked to regulatory sequences in a sense or antisense orientation. 
In another example, the complementary RNA regions of the invention can be 
operably linked, either directly or indirectly, 5' to the target mRNA, or 3' to the target 
mRNA, or within the target mRNA, or a first complementary region is 5' and its 
complement is 3* to the target mRNA. 

Standard recombinant DNA and molecular cloning techniques used herein 
are well known in the art and are described more fully In Sambrook, J., Fritsch, E.F. 
and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory Press: Cold Spring Harbor, 1989. Transformation methods are well 
known to those skilled in the art and are described below. 

"PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of 
large quantities of specific DNA segments, consists of a series of repetitive cycles 
(Peri<in Elmer Cetus Instruments, Nonwalk, CT). Typically, the double stranded 
DNA is heat denatured, the two primers complementary to the 3' boundaries of the 
target segment are annealed at low temperature and then extended at an 
intermediate temperature. One set of these three consecutive steps is referred to 
as a cycle. 
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The term "recombinant" refers to an artificial combination of two otherwise 
separated segments of sequence, e.g., by chemical synthesis or by the 
manipulation of isolated segments of nucleic acids by genetic engineering 
techniques. 

5 The terms "recombinant constmct", "expression construct", "chimeric 

construct", "construct", and "recombinant DNA construct" are used interchangeably 
herein. A recombinant construct comprises an artificial combination of nucleic acid 
fragments, e.g., regulatory and coding sequences that are not found together in 
nature. For example, a chimeric construct may comprise regulatory sequences and 

10 coding sequences that are derived from different sources, or regulatory sequences 
and coding sequences derived from the same source, but arranged in a manner 
different than that found in nature. Such construct may be used by itself or may be 
used in conjunction with a vector. If a vector is used then the choice of vector is 
dependent upon the method that will be used to transform host cells as is well 

1 5 known to those skilled in the art. For example, a plasmid vector can be used. The 
skilled artisan is well aware of the genetic elements that must be present on the 
vector in order to successfully transform, select and propagate host cells comprising 
any of the isolated nucleic acid fragments of the invention. The skilled artisan will 
also recognize that different independent transformation events will result in different 

20 levels and patterns of expression (Jones et al.. (1985) EMBO J. 4:241 1-2418; 
De Almeida et al., (1989) Mol. Gen. Genetics 218:78-86). and thus that multiple 
events must be screened in order to obtain lines displaying the desired expression 
level and pattern. Such screening may be accomplished by Southern analysis of 
DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein 

25 expression, or phenotypic analysis, among others. 

The term "expression", as used herein, refers to the production of a functional 
end-product e.g., a mRNA or a protein (precursor or mature). 

The term "expression cassette" as used herein, refers to a discrete nucleic 
acid fragment into which a nucleic acid sequence or fragment can be moved. 

30 "Mature" protein refers to a post-translationally processed polypeptide; i.e., 

one from which any pre- or propeptides present in the primary translation product 
have been removed. "Precursor" protein refers to the primary product of translation 
of mRNA; i.e., with pre- and propeptides still present. Pre- and propeptides may be 
but are not limited to intracellular localization signals. 

35 "Stable transformation" refers to the transfer of a nucleic acid fragment into a 

genome of a host organism, including both nuclear and organellar genomes, 
resulting in genetically stable inheritance. In contrast, "transient transformation" 
refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing 
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organelle, of a host organism resulting in gene expression without integration or 
stable inheritance. Host organisms containing the transformed nucleic acid 
fragments are referred to as "transgenic" organisms. 

"Antisense inhibition" refers to the production of antisense RNA transcripts 
capable of suppressing the expression of the target protein. "Co-suppression" 
refers to the production of sense RNA transcripts capable of suppressing the 
expression of identical or substantially similar foreign or endogenous genes 
(U.S. Patent No. 5.231,020). Co-suppression constructs in plants previously have 
been designed by focusing on overexpression of a nucleic acid sequence having 
homology to an endogenous mRNA, in the sense orientation, which results In the 
reduction of all RNA having homology to the overexpressed sequence (see 
Vaucheret et al. (1998) Plant J, 16:651-659; and Gura (2000) Nature 404:804-808). 
The overall efficiency of this phenomenon is low, and the extent of the RNA 
reduction is widely variable. Recent work has described the use of "hairpin" 
structures that incorporate all, or part, of an mRNA encoding sequence in a 
complementary orientation that results in a potential "stem-loop" structure for the 
expressed RNA (PCT Publication WO 99/53050 published on October 21, 1999 and 
more recently, Applicants' assignee's PCT Application having international 
publication number WO 02/00904 published on January 3, 2002). This Increases 
the frequency of co-suppression in the recovered transgenic plants. Another 
variation describes the use of plant viral sequences to direct the suppression, or 
"silencing", of proximal mRNA encoding sequences (PCT Publication WO 98/36083 
published on August 20, 1998). Both of these co-suppressing phenomena have not 
been elucidated mechanistically, although genetic evidence has begun to unravel 
this complex situation (Elmayan et al. (1998) Plant Ce// 10:1747-1757). 

The polynucleotide sequences used for suppression do not necessarily have 
to be 100% complementary to the polynucleotide sequences found in the gene to be 
suppressed. For example, suppression of all the subunits of the soybean seed 
storage protein (J-conglycinin has been accomplished using a polynucleotide 
derived from a portion of the gene encoding the a subunit (U.S. Patent 
No. 6,362,399). (i-conglycinin is a heterogeneous glycoprotein composed of varying 
combinations of three highly negatively charged subunits identified as a, a' and p. 
The polynucleotide sequences encoding the a and a* subunits are 85% identical to 
each other while the polynucleotide sequences encoding the p subunit are 75 to 
80% identical to the a and a' subunits. Thus, polynucleotides that are at least 75% 
identical to a region of the polynucleotide that is target for suppression have been 
shown to be effective in suppressing the desired target. The polynucleotide should 
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be at least 80% identical, preferably at least 90% identical, most preferably at least 
95% identical, or the polynucleotide may be 100% identical to the desired target. 

The present invention concerns an oilseed plant that produces mature seeds 
in which the total seed fatty acid profile comprises at least 1 .0 % of at least one 
5 polyunsaturated fatty acid having at least twenty carbon atoms and five or more 
carbon-carbon double bonds. 

In a second embodiment, this invention concerns an oilseed plant that 
produces mature seeds in vyhich the total seed fatty acid profile comprises at least 
5.0 % of at least one polyunsaturated fatty acid having at least twenty carbon atoms 
10 and five or more carbon-carbon double bonds. 

In a third embodiment, this invention concerns an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 10.0 % of 
at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds. 
15 Additional embodiments of this invention include an oilseed plant that 

produces mature seeds in which the total seed fatty acid profile comprises at least 
1 5.0 %, 20 %, 25 %, 30 %, 40 %. 50 %. or 60 % of at least one polyunsaturated 
fatty acid having at least twenty carbon atoms and five or more carbon-carbon 
double bonds. Indeed, one might expect that any integer level of accumulation of at 
20 least one polyunsaturated fatty acid from about 1 % to about 60 % of the total seed 
fatty acid profile could be obtained. 

In a fourth embodiment, this invention concerns an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
10.0 % of at least one polyunsaturated fatty acid having at least twenty carbon 
25 atoms and five or more carbon-carbon double bonds and less than 2.0% 
arachidonic acid. 

Again additional embodiments would include an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 15.0 %, 
20 %, 25 %, 30 %, 40 %, 50 %, or 60 % of at least one polyunsaturated fatty acid 

30 having at least twenty carbon atoms and five or more carbon-carbon double bonds 
and less than 2.0% arachidonic acid. Indeed, one might expect that any integer level 
of accumulation of at least one polyunsaturated fatty acid from about 1 % to about 
60 % of the total seed fatty acid profile could be obtained while accumulating less 
than 2 % arachidonic acid. 

35 Examples of oilseed plants include, but are not limited to, soybean, Brassica 

species, sunflower, maize, cotton, flax, and safflower. 

Examples of polyunsaturated fatty acids having at least twenty carbon atoms 
and five or more carbon-carbon double bonds include, but are not limited to, omega- 
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3 fatty acids such as EPA, DPA and DHA. Seeds obtained from such plants are 
also within the scope of this invention as well as oil obtained from such seeds. 

In a fifth embodiment this invention concerns a recombinant construct for 
altering the total fatty acid profile of mature seeds of an oilseed plant, said construct 
comprising at least two promoters wherein each promoter is operably linked to a 
nucleic acid sequence encoding a polypeptide required for making at least one 
polyunsaturated fatty acid having at least twenty carbon atoms and four or more 
carbon-carbon double bonds and further wherein the total fatty acid profile 
comprises at least 2 % of at least one polyunsaturated fatty acid having at least 
twenty carbon atoms and four or more carbon-carbon double bonds and further 
wherein said polypeptide is an enzyme selected from the group consisting of a 
A 4 desaturase, a A 5 desaturase, A 6 desaturase, a A 15 desaturase, a 
A 17 desaturase, a C18 to C22 elongase and a C20 to C24 elongase. 

Such desaturases are discussed in U.S. Patent Nos. 6,075,183, 5,968,809, 
6,136,574, 5,972,664, 6,051,754, 6,410,288 and WO 98/46763, WO 98/46764, 
WO 00/12720, WO 00/40705 

The choice of combination of cassettes used depends In part on the PUFA 
profile and/or desaturase profile of the oilseed plant cells to be transformed and the 
LC-PUFA which is to be expressed. 

A number of enzymes are involved in PUFA biosynthesis. Linoleic acid (LA, 
18:2 A9,12) is produced from oleic acid (18:1 A9) by a delta-12 desaturase. GLA 
(18:3 A 6, 9,12) is produced from linoleic acid (18:2 A9,12 ) by a delta-6 desaturase. 
ARA(20:4 A 5, 8, 1 1 , 14 ) production from dihomo-gamma-linolenic acid (DGLA 20:3 
A 8, 1 1, 14 ) is catalyzed by a delta-5 desaturase. However, animals cannot 
desaturate beyond the delta-9 position and therefore cannot convert oleic acid (18:1 
A9) into linoleic acid (LA, 18:2 A9.12). Likewise, alpha-linolenic acid (AI^ 18:3 A 9. 
12, 15) cannot be synthesized by mammals. Other eukaryotes, including fungi and 
plants, have enzymes which desaturate at positions delta-12 and delta-5. The major 
poly-unsaturated fatty acids of animals therefore are either derived from diet and/or 
from desaturation and elongation of linoleic acid (LA, 18:2 A9,12) or alpha-linolenic 
acid (ALA 18:3 A 9, 12, 15). 

The elongation process in plants involves a four-step process initiated by the 
crucial step of condensation of malonate and a fatty acid with release of a carbon 
dioxide molecule. The substrates in fatty acid elongation are CoA thioesters. The 
condensation step is mediated by a 3-ketoacyl synthase, which is generally rate 
limiting to the overall cycle of four reactions and provides some substrate specificity. 
The product of one elongation cycle regenerates a fatty acid that has been extended 
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by two carbon atoms (Browse et al., Trends in Biochemical Sciences 27(9): 467-473 
(September 2002); Napier, Trends in Plant Sciences 7(2): 51-54 (February 2002)). 

As was noted above, a promoter is a DNA sequence that directs cellular 
machinery of a plant to produce RNA from the contiguous coding sequence 
5 downstream (3') of the promoter. The promoter region influences the rate, 

developmental stage, and cell type in which the RNA transcript of the gene is made. 
The RNA transcript is processed to produce messenger RNA (mRNA) which serves 
as a template for translation of the RNA sequence into the amino acid sequence of 
the encoded polypeptide. The 5' non-translated leader sequence is a region of the 

10 mRNA upstream of the protein coding region that may play a role in initiation and 

translation of the mRNA. The 3' transcription termination/polyadenylatlon signal is a 
non-translated region downstream of the protein coding region that functions in the 
plant cells to cause termination of the RNA transcript and the addition of 
polyadenylate nucleotides to the 3* end of the RNA. 

1 5 The origin of the promoter chosen to drive expression of the coding sequence 

is not important as long as it has sufficient transcriptional activity to accomplish the 
invention by expressing translatable mRNA for the desired nucleic acid fragments in 
the desired host tissue at the right time. Either heterologous or non-heterologous 
(i.e., endogenous) promoters can be used to practice the invention. 

20 Suitable promoters which can be used to practice the invention include, but 

are not limited to, the alpha prime subunit of beta conglycinin promoter, Kunitz 
trypsin inhibitor 3 promoter, annexin promoter, Gly1 promoter, beta subunit of beta 
conglycinin promoter, P34/Gly Bd m 30K promoter, albumin promoter. Leg Al 
promoter and Leg A2 promoter. The level of activity of the annexin, or P34, 

25 promoter is comparable to that of many known strong promoters, such as the CaMV 
35S promoter (Atanassova et al., (1998) Plant MoL BioL 37:275-285; Battraw and 
Hall, (1990) Plant Moi BioL 15:527-538; Holtorf et al., (1995) Plant Moi BioL 
29:637-646; Jefferson et al., (1987) EMBO J. 6:3901-3907; Wilmink et al., (1995) 
Plant MoL BioL 28:949-955), the Arabidopsis oleosin promoters (Plant et a!., (1994) 

30 Plant MoL BioL 25:193-205; Li, (1997) Texas A&M University Ph.D. dissertation, 
pp. 107-128), the Arabidopsis ubiquitin extension protein promoters (Callis et al., 
1990), a tomato ubiquitin gene promoter (Rollfinke ef aL, 1998), a soybean heat 
shock protein promoter (SchoffI et aL, 1989), and a maize H3 histbne gene promoter 
(Atanassova et al., 1998). 

35 Expression of chimeric genes in most plant cell makes the annexin, or P34, 

promoter, which constitutes the subject matter of Applicants' Assignee's copending 
application having Application No. 60/446,833 and Attorney Docket No. BB-1531 
which is filed concurrently herewith, especially useful when seed specific expression 

19 



wo 2004/071467 PCT/US2004/005758 

of a target heterologous nucleic acid fragment is required. Another useful feature of 
the annexin promoter is its expression profile in developing seeds. The annexin 
promoter of the invention is most active in developing seeds at early stages (before 
10 days after pollination) and is largely quiescent in later stages. The expression 
5 profile of the annexin promoter is different from that of many seed-specific 

promoters, e.g., seed storage protein promoters, which often provide highest activity 
In later stages of development (Chen et a!., (1989) Dev. Genet 10:112-122; 
Elierstrom et al., (1996) Plant MoL Biol, 32:1019-1027; Keddie et al,. (1994) Plant 
Mol. Biol. 24:327-340; Plant et al., (1994) Plant MoL BioL 25:193-205; LI, (1997) 

10 Texas A&M University Ph.D. dissertation, pp. 107-128). The P34 promoter has a 
more conventional expression profile but remains distinct from other known seed 
specific promoters. Thus, the annexin, or P34, promoter will be a very attractive 
candidate when overexpression, or suppression, of a gene in embryos is desired at 
an early developing stage. For example, it may be desirable to overexpress a gene 

15 regulating early embryo development or a gene involved in the metabolism prior to 
seed maturation. 

The promoter is then operably linked in a sense orientation using 
conventional means well known to those skilled in the art. 

Once the recombinant construct has been made, It may then be introduced 

20 into the oilseed plant cell of choice by methods well known to those of ordinary skill 
in the art including, for example, transfection, transformation and electroporation as 
described above. The transformed plant cell is then cultured and regenerated under 
suitable conditions permitting expression of the LC-PUFA which is then recovered 
and purified. 

26 The recombinant constructs of the invention may be introduced into one plant 

cell or, alternatively, each construct may be introduced into separate plant cells. 

Expression in a plant cell may be accomplished in a transient or stable 
fashion as is described above. 

The desired LC-PUFAs can be expressed in seed. Also within the scope of 
30 this invention are seeds or plant parts obtained from such transformed plants. 

Plant parts include differentiated and undifferentiated tissues, including but 
not limited to, roots, stems, shoots, leaves, pollen, seeds, tumor tissue, and various 
forms of cells and culture such as single cells, protoplasts, embryos, and callus 
tissue. The plant tissue may be in plant or in organ, tissue or cell culture. 
35 Methods for transforming dicots, primarily by use of Agrobacterium 

tumefaciens, and obtaining transgenic plants have been published, among others, 
for cotton (U.S. Patent No. 5,004,863, U.S. Patent No. 5,159,135); soybean 
(U.S. Patent No. 5,569,834, U.S. Patent No. 5.416,011); Brassica (U.S. Patent 
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No. 5,463,174); peanut (Cheng et al. (1996) Plant Cell Rep. 15:653-657. McKently 
et al. (1995) Plant Cell Rep. 14:699-703); papaya (Ling, K. et al. (1991) 
Bio/technology 9:752-758); and pea (Grant et al. (1 995) Plant Cell Rep. 
15:254-258). For a review of other connmonly used methods of plant transformation 
5 see Newell, C.A. (2000) MoL BiotechnoL 16:53-65. One of these methods of 

transformation uses Agrobacterium rhizogenes (Tepfler, and Casse-Delbart, F. 

(1987) Microbiol. Sci. 4:24-28). Transformation of soybeans using direct delivery of 
DMA has been published using PEG fusion (PCT publication WO 92/17598), 
electroporation (Chowrira, G.M. et al. (1995) Mol. BiotechnoL 3:17-23; Christou, P. 

10 et al. (1987) Proc. Natl. Acad. Sci. U.S.A. 84:3962-3966), microinjection, or particle 
bombardment (McCabe, D.E. et. al. (1988) Bio/Technology 6:923] Christou et al. 

(1988) Plant Physiol. 87:671-674). 

There are a variety of methods for the regeneration of plants from plant 
tissue. The particular method of regeneration will depend on the starting plant tissue 

15 and the particular plant species to be regenerated. The regeneration, development 
and cultivation of plants from single plant protoplast transformants or from various 
transformed explants is well known in the art (Weissbach and Weissbach, (1988) 
In.: Methods for Plant Molecular Biology, (Eds.), Academic Press, Inc., San Diego, 
CA). This regeneration and growth process typically includes the steps of selection 

20 of transformed cells, culturing those individualized cells through the usual stages of 
embryonic development through the rooted plantlet stage. Transgenic embryos and 
seeds are similarly regenerated. The resulting transgenic rooted shoots are 
thereafter planted in an appropriate plant growth medium such as soil. Preferably, 
the regenerated plants are self-pollinated to provide homozygous transgenic plants. 

25 Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown 
plants of agronomically important lines. Conversely, pollen from plants of these 
important lines is used to pollinate regenerated plants. A transgenic plant of the 
present invention containing a desired polypeptide is cultivated using methods well 
known to one skilled in the art. 

30 In addition to the above discussed procedures, practitioners are familiar with 

the standard resource materials which describe specific conditions and procedures 
for the construction, manipulation and isolation of macromolecules (e.g., DNA 
molecules, plasmids, etc.), generation of recombinant DNA fragments and 
recombinant expression constructs and the screening and isolating of clones, (see 

35 for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Press; Maliga et al. (1995) Methods in Plant Molecular Biology, Cold 
Spring Harbor Press; Birren et al. (1998) Genome Analysis: Detecting Genes, 1, 
Cold Spring Harbor, New York; Birren et al. (1998) Genome Analysis: Analyzing 
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DNA, 2, Cold Spring Harbor, New York; Plant Molecular Biology: A Laboratory 
Manual, eds. Clark, Springer, New York (1997)). 

In another aspect, this invention concerns a method for making an oilseed 
plant having an altered fatty acid profile which comprises: 
5 a) transforming a plant with the recombinant construct of the invention; 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
least 1 .0 % of at least one polyunsaturated fatty acid having at least twenty carbon 
atoms and five or more carbon-carbon double bonds. 

10 Methods of isolating seed oils are well known in the art: (Young et al, 

Processing of Fats and Oils, in "The Lipid Handbook" (Gunstone et al eds.) 
Chapters pp 253-257; London, Chapman & Hall. 1994). 

The altered seed oils can then be added to nutritional compositions such as a 
nutritional supplement, food products, infant formula, animal feed, pet food and the 

15 like. 

Compared to other vegetable oils, the oils of the invention are believed to 
function similarly to other oils in food applications from a physical standpoint 
Partially hydrogenated oils, such as soybean oil, are widely used as ingredients for 
soft spreads, margarine and shortenings for baking and frying. 

20 Examples of food products or food analogs into which altered seed oils or 

altered seeds of the invention may be incorporated include a meat product such as 
a processed meat product, a cereal food product, a snack food product, a baked 
goods product, a fried food product, a health food product, an infant formula, a 
beverage, a nutritional supplement, a dairy product, a pet food product, animal feed 

25 or an aquaculture food product. Food analogs can be made use processes well 

known to those skilled in the art. U.S. Patent Nos. 6,355,296 B1 and 6,187,367 B1 
describe emulsified meat analogs and emulsified meat extenders. U.S. Patent 
No. 5,206,050 B1 describes soy protein curd useful for cooked food analogs (also 
can be used as a process to form a curd useful to make food analogs). U.S. Patent 

30 No. 4,284,656 to Hwa describes a soy protein curd useful for food analogs. 
U.S. Patent No. 3,988,485 to Hibbert et al. describes a meat-like protein food 
formed from spun vegetable protein fibers. U.S. Patent No. 3,950,564 to Puski et al. 
describes a process of making a soy based meat substitute and U.S. Patent 
No. 3,925,566 to Reinhart et al. describes a simulated meat product. For example, 

35 soy protein that has been processed to impart a structure, chunk or fiber for use as 
a food ingredient is called "textured soy protein" (TSP). TSPs are frequently made 
to resemble meat, seafood, or poultry in structure and appearance when hydrated. 
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There can be mentioned meat analogs, cheese analogs, milk analogs and 
the like. 

Meat analogs made from soybeans contain soy protein or tofu and other 
ingredients mixed together to simulate various kinds of meats. These meat 
5 alternatives are sold as frozen, canned or dried foods. Usually, they can be used 
the same way as the foods they replace. Meat alternatives made from soybeans 
are excellent sources of protein, iron and B vitanriins. Examples of meat analogs 
include, but are not limited to, ham analogs, sausage analogs, bacon analogs, and 
the like. 

10 Food analogs can be classified as imitiation or substitutes depending on their 

functional and compositional characteristics. For example, an imitation cheese 
need only resemble the cheese it is designed to replace. However, a product can 
generally be called a substitute cheese only if it is nutritionally equivalent to the 
cheese it is replacing and meets the minimum compositional requirements for that 

15 cheese. Thus, substitute cheese will often have higher protein levels than imitation 
cheeses and be fortified with vitamins and minerals. 

Milk analogs or nondairy food products include, but are not limited to, 
imitation milk, nondairy frozen desserts such as those made from soybeans and/or 
soy protein products. 

20 Meat products encompass a broad variety of products. In the United States 

"meat" includes "red meats" produced from cattle, hogs and sheep. In addition to 
the red meats there are poultry items which include chickens, turkeys, geese, 
guineas, ducks and the fish and shellfish. There is a wide assortment of seasoned 
and processes meat products: fresh, cured and fried, and cured and cooked. 

25 Sausages and hot dogs are examples of processed meat products. Thus, the term 
"meat products" as used herein includes, but is not limited to, processed meat 
products. 

A cereal food product is a food product derived from the processing of a 
cereal grain. A cereal grain includes any plant from the grass family that yields an 
30 edible grain (seed). The most popular grains are barley, corn, millet, oats, quinoa, 
rice, rye, sorghum, triticale, wheat and wild rice. Examples of a cereal food product 
include, but are not limited to, whole grain, crushed grain, grits, flour, bran, germ, 
breakfast cereals, extruded foods, pastas, and the like. 

A baked goods product comprises any of the cereal food products mentioned 
35 above and has been baked or processed in a manner comparable to baking, i.e., to 
dry or harden by subjecting to heat. Examples of a baked good product include, but 
are not limited to bread, cakes, doughnuts, bread crumbs, baked snacks, mini- 
biscuits, mini«crackers, mini-cookies, and mini-pretzels. As was mentioned above, 
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oils of the invention can be used as an ingredient. 

In general, soybean oil is produced using a series of steps involving the 
extraction and purification of an edible oil product from the oil bearing seed. 
Soybean oils and soybean byproducts are produced using the generalized steps 
shown in the diagram below. 
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Process Impurities Removed/ 

Byproducts Obtained 

. Soybean 

t 

Oil Extraction 

t 

Degumming 

t 

Alkali or Physical Refining 

t 

Water Washing 

t 

Bleaching 

(Hydrogenation) 

t 

(Winterization) 

t 

Deodorization 

t 

Oil Products 

Soybean seeds are cleaned, tempered, dehuiled, and flaked which increases 
the efficiency of oil extraction. Oil extraction is usually accomplished by solvent 
(hexane) extraction but can also be achieved by a combination of physical pressure 
and/or solvent extraction. The resulting oil is called crude oil. The crude oil may be 
degummed by hydrating phospholipids and other polar and neutral lipid complexes 
that facilitate their separation from the nonhydrating, triglyceride fraction (soybean 
oil). The resulting lecithin gums may be further processed to make commercially 
important lecithin products used in a variety of food and industrial products as 
emulsification and release (antisticking) agents. Degummed oil may be further 
refined for the removal of impurities; primarily free fatty acids, pigments, and 

25 
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Soap 
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Stearine 
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residual gums. Refining is accomplished by the addition of a caustic agent that 
reacts with free fatty acid to form soap and hydrates phosphatides and proteins In 
the crude oil. Water is used to wash out traces of soap formed during refining. The 
soapstock byproduct may be used directly in animal feeds or acidulated to recover 
5 the free fatty acids. Color is removed through adsorption with a bleaching earth that 
removes most of the chlorophyll and carotenoid compounds. The refined oil can be 
hydrogenated resulting in fats with various melting properties and textures. 
Winterization (fractionation) may be used to remove stearine from the hydrogenated 
oil through crystallization under carefully controlled cooling conditions. 

10 Deodorization which is principally steam distillation under vacuum, is the last step 
and is designed to remove compounds which impart odor or flavor to the oil. Other 
valuable byproducts such as tocopherols and sterols may be removed during the 
deodorization process. Deodorized distillate containing these byproducts may be 
sold for production of natural vitamin E and other high-value pharmaceutical 

15 products. Refined, bleached, (hydrogenated, fractionated) and deodorized oils and 
fats may be packaged and sold directly or further processed into more specialized 
products. A more detailed reference to soybean seed processing, soybean oil 
production and byproduct utilization can be found in Erickson. 1995, Practical 
Handbook of Soybean Processing and Utilization, The American Oil Chemists' 

20 Society and United Soybean Board. 

Soybean oil is liquid at room temperature because it is relatively low in 
saturated fatty acids when compared with oils such as coconut, palm, palm kernel 
and cocoa butter. Many processed fats, including spreads, confectionary fats, hard 
butters, margarines, baking shortenings, etc., require varying degrees of solidity at 

25 room temperature and can only be produced from soybean oil through alteration of 
Its physical properties. This is most commonly achieved through catalytic 
hydrogenation, 

Hydrogenation is a chemical reaction in which hydrogen Is added to the 
unsaturated fatty acid double bonds with the aid of a catalyst such as nickel. High 

30 oleic soybean oil contains unsaturated oleic, linoleic, and linolenic fatty acids and 
each of these can be hydrogenated. Hydrogenation has two primary effects. First, 
the oxidative stability of the oil is increased as a result of the reduction of the 
unsaturated fatty acid content Second, the physical properties of the oil are 
changed because the fatty acid modifications increase the melting point resulting In 

35 a semi-liquid or solid fat at room temperature. 

There are many variables which affect the hydrogenation reaction which in 
turn alter the composition of the final product. Operating conditions including 
pressure, temperature, catalyst type and concentration, agitation and reactor design 
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are among the more important parameters which can be controlled. Selective 
hydrogenation conditions can be used to hydrogenate the more unsaturated fatty 
acids in preference to the less unsaturated ones. Very light or brush hydrogenation 
Is often employed to increase stability of liquid oils. Further hydrogenation converts 
5 a liquid oil to a physically solid fat. The degree of hydrogenation depends on the 
desired performance and melting characteristics designed for the particular end 
product Liquid shortenings, used in the manufacture of baking products, solid fats 
and shortenings used for commercial frying and roasting operations, and base 
stocks for margarine manufacture are among the myriad of possible oil and fat 

10 products achieved through hydrogenation. A more detailed description of 

hydrogenation and hydrogenated products can be found in Patterson, H. B. W., 
1994, Hydrogenation of Fats and Oils: Theory and Practice. The American Oil 
Chemists' Society. 

Hydrogenated oils have also become controversial due to the presence of 

15 trans fatty acid isomers that result from the hydrogenation process. Ingestion of 
large amounts of trians isomers has been linked with detrimental health effects 
including increased ratios of low density to high density lipoproteins in the blood 
plasma and increased risk of coronary heart disease, 

A snack food product comprises any of the above or below described food 

20 products. 

A fried food product comprises any of the above or below described food 
products that has been fried. 

A health food product is any food product that imparts a health benefit. Many 
oilseed-derived food products may be considered as health foods. 
25 The beverage can be in a liquid or in a dry powdered form. 

For example, there can be mentioned non-carbonated drinks; fruit juices, 
fresh, frozen, canned or concentrate; flavored or plain milk drinks, etc. Adult and 
infant nutritional formulas are well known in the art and commercially available (e.g., 
Similac®, Ensure®, Jevity®, and Alimentum® from Ross Products Division, Abbott 
30 Laboratories). 

Infant formulas are liquids or reconstituted powders fed to infants and young 
children. They serve as substitutes for human milk. Infant formulas have a special 
role to play in the diets of infants because they are often the only source of nutrients 
for infants. Although breast-feeding is still the best nourishment for infants, infant 
35 formula is a close enough second that babies not only survive but thrive. Infant 
formula is becoming more and more increasingly close to breast milk. 

A dairy product is a product derived from milk. A milk analog or nondairy 
product is derived from a source other than milk, for example, soymilk as was 
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discussed above. These products include, but are not limited to, whole milk, skim 
milk, fermented milk products such as yogurt or sour milk, cream, butter, condensed 
milk, dehydrated milk, coffee whitener, coffee creamer, ice cream, cheese, etc. 

A pet food product is a product intended to be fed to a pet such as a dog, cat, 
5 bird, reptile, fish, rodent and the like. These products can include the cereal and 
health food products above, as well as meat and meat byproducts, soy protein 
products, grass and hay products, including but not limited to alfalfa, timothy, oat or 
brome grass, vegetables and the like. 

Animal feed is a product intended to be fed to animals such as turkeys, 
10 chickens, cattle and swine and the like. As with the pet foods above, these products 
can include cereal and health food products, soy protein products, meat and meat 
byproducts, and grass and hay products as listed above. 
V Aqualculture feed is a product intended to be used in aquafarming which 

concerns the propagation, cultivation or farming of aquatic organisms, animals 
15 and/or plants in fresh or marine waters. 

In yet another embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises 
polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of EPAiDHA is in the range from 
20 1 :10G to 860:100. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 

In still yet another embodiment, this invention includes an oilseed plant th^t 
produces mature seeds in which the total seed fatty acid profile comprises 
25 polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of DHArEPA is in the range from 
1:100 to 110:100. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 
30 It is reasonable to believe that any integer ratio of EPA:DHA from 1 : 1 00 

through 860:100, or DHA: EPA from 1:100 through 110:100, might be obtainable in 
plants described or envisioned within the scope and spirit of the present invention. 

EXAMPLES 

The present invention is further defined in the following Examples, in which 
35 all parts and percentages are given as weight to volume, and degrees are Celsius, 
unless othenwise stated. It should be understood that these Examples, while 
indicating preferred embodiments of the invention, are given by way of illustration 
only. From the above discussion and these Examples, one skilled in the art can 
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ascertain the essential characteristics of this invention, and without departing from 
the spirit and scope thereof, can make various changes and modifications of the 
invention to adapt it to various uses and conditions. Thus, various modifications of 
the invention in addition to those shown and described herein will be apparent to 
5 those skilled in the art from the foregoing description. Such modifications are also 
intended to fall within the scope of the appended claims. 

The disclosures contained within the references used herein are hereby 
incorporated by reference. 
GENERAL MATERIALS AND METHODS 

10 Procedures for nucleic acid phosphorylation, restriction enzyme digests, 

ligation and transformation are well known in the art. Techniques suitable for use in 
the following examples may be found in Sambrook, J,, Fritsch, E. F. and Maniatis, 
T., Molecular Cloning: A Laboratorv Manual . Second Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989) (hereinafter "Maniatis"). 

15 Materials and methods suitable for the maintenance and growth of bacterial 

cultures are well known in the art. Techniques suitable for use in the following 
examples mav be found as set out in Manual of Methods for General Bacterioloav 
(Phillipp Gerhardt. R. G. E. Murray, Ralph N. Costilow, Eugene W. Nester, Willis A. 
Wood, Noel R. Krieg and G. Briggs Phillips, eds), American Society for 

20 Microbiology, Washington, DC. (1994)) or by Thomas D. Brock in Biotechnoloav: A 
Textbook of Industrial Microbiologv , Second Edition. Sinauer Associates, Inc., 
Sunderland, MA (1989). All reagents, restriction enzymes and materials used for 
the growth and maintenance of bacterial and plant cells were obtained from Aldrich 
Chemicals (Milwaukee, Wl), DIFCO Laboratories (Detroit, Ml), GIBCO/BRL 

25 (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) unless otherwise 
specified. 

The meaning of abbreviations is as follows: "h" or "hr" means hour(s), "min" 
or "min." means minute(s), "sec" or "s" means second(s), "d" or "day" means day(s), 
"mL" means milliliters, "L" means liters. 

30 Bacterial Strains and Plasmids: 

E. CO// TOP10 cells and E. co// electromax DH10B cells were obtained from 
Invitrogen (Carlsbad, CA). Max Efficiency competent cells of E. coli DH5a were 
obtained from GIBCO/BRL (Gaithersburg, MD). Plasmids containing EPA or DHA 
biosynthetic pathway genes were obtained from Ross Products Division, Abbott 

35 Laboratories, Columbus OH. The genes and the source plasmids are listed in 
Table 1. 
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TABLE 1 



EPA B10SYNTHE1 


nC PATHWAY GENES 


Gene 


Organism 


Piasmid Name 


Reference 


Delta-6 desaturase 


S. diclina 


pRSP1 


WO 02/081668 


Delta-6 desaturase 


M alpina 


pCGR5 


US Patent 5,968,809 


Elongase 


M. alpina 


pRPB2 


WO 00/12720 


Delta-5 desaturase 


M. alpina 


pCGR4 


US Patent 6,075,183 


Delta-5 desaturase 


S- diclina 


pRSP3 


WO 02/081668 


Delta-17 desaturase 


S. diclina 


pRSP19 


Example 6 


Elongase 


r. aureum 


pRAT-4-A7 


WO 02/08401 


Elongase 


Pavlova sp. 


pRPL-6-B2 


Example 1 3 


Delta-4 desaturase 


S. aggregatum 


pRSA1 


WO 02/090493 



Plasmids pKS102 and pKS121 are described in WO 02/00904. Plasmid 
5 pKS123 is described in WO 02/08269. Plasmid pCF3 is described in [Yadav, N.S. 
et al (1993) Plant Physiol. 103:467-76]. Cloning vector pCR-Script AMP SK(+) was 
from Stratagene (La Jolla, CA). Cloning vector pUC19 [Messing, J. (1983) Mef/?. 
EnzymoL 101:20] was from New England Biolabs (Beverly, MA). Cloning vector 
pGEM-T easy was from Promega (Madison, Wl). 
10 Growth Conditions: 

Bacterial cells were usually grown in Lurla-Bertani (LB) medium containing 
1% of bacto-tryptone, 0.5% of bacto-yeast extract and 1% of NaCI. Occasionaly, 
bacterial cells were grown in SOC medium containing 2% of bacto-tryptone, 0.5% of 
bacto-yeast extract, 0.5% of NaCI and 20 mM glucose or in Superbroth (SB) 
16 containing 3.5% of bacto-tryptone, 2% of bacto-yeast extract, 0.05% of NaCI and 
0.005 M NaOH. 

Antibiotics were often added to liquid or solid media in order to select for 
plasmids or insertions with appropriate antibiotic resistance genes. Kanamycin, 
amplcillin and hygromycin were routinely used at final concentrations of 50 pg/mL 
20 (Kan50), 100 [jg/mL (AmplOO) or 50 pg/mL (HygSO), respectively. 

EXAMPLE 1 
Isolation of sovbean seed-specific promoters 
The soybean annexin and BD30 promoters were isolated with the Universal 
GenomeWalker system (Clontech) according to its user manual (PT3042-1). To 
25 make soybean GenomeWalker libraries, samples of soybean genomic DNA were 
digested with Oral, EcoRV, PvuU and Stul separately for two hours. After DNA 
purification, the digested genomic DNAs were ligated to the GenomeWalker 
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adaptors AP1 and AP2. 

Two gene specific primers (GSP1 and GSP2) were designed for soybean 
annexin gene based on the 5' coding sequences in annexin cDNA in DuPont EST 
database. The sequences of GSP1 and GSP2 are set forth in SEQ ID NOS:1 and 
5 2. 

GCCCCCCATCCTTTGAAAGCCTGT SEQ ID NO:1 

CGCGGATCCGAGAGCCTCAGCATCTTGAGCAGAA SEQ ID NO:2 

10 

The AP1 and the GSP1 primers were used in the first round PGR using the 
conditions defined in the GenomeWalker system protocol. Cycle conditions were 
94°C for 4 minutes; 94°C for 2 second and 72''C for 3 minutes, 7 cycles; 94°C for 
2 second and 67°G for 3 minutes, 32 cycles; 67°C for 4 minutes. The products from 

15 the first run PGR were diluted 50-fpld. One microliter of the diluted products were 
used as templates for the second PGR with the AP2 and GSP2 as primers. Cycle 
conditions, were 94*'C for 4 minutes; 94'*C for 2 second and 72*'C for 3 min, 5 cycles; 
94°C for 2 second and ST^'C for 3 minutes, 20 cycles; 67**C for 3 minutes, A 2.1 kb 
genomic fragment was amplified and isolated from the EcoRV-digested 

20 GenomeWalker library. The genomic fragment was digested with Bam\-\ I and Sal I 
and cloned into Bluescript KS"^ vector for sequencing. The DNA sequence of this 
2012 bp soybean annexin promoter fragment is set forth in SEQ ID NO:3. 

Two gene specific primers (GSP3 and GSP4) were designed for soybean 
BD30 based on the 5' coding sequences in BD30 cDNA in NCBI GenBank 

25 (J05560). The oligonucleotide sequences of the GSP3 and GSP4 primers have the 
sequences set forth in SEQ ID NOS:4 and 5. 

GGTCCAATATGGAACGATGAGTTGATA SEQ ID NO:4 

30 CGCGGATCCGCTGGAACTAGAAGAGAGACCTAAGA SEQ ID NO:5 

The API and the GSP3 primers were used in the first round PGR using the 
same conditions defined in the GenomeWalker system protocol. The cycle 
conditions used for soybean annexin promoter do not work well for the soybean 
35 BD30 promoter in GenomeWalker experiment. A modified touchdown PGR protocol 
was used. Cycle conditions were: 94^C for 4 minutes; 94**C for 2 second and 74*^0 
for 3 minutes, 6 cycles in which annealing temperature drops 1**C every cycle; 94*^0 
for 2 second and 69°C for 3 minutes, 32 cycles; 69°C for 4 minutes. The products 
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from the l^t run PGR were diluted 50-fold. One microliter of the diluted products 
were used as templates for the 2"^ PCR with the AP2 and GSP4 as primers. Cycle 
conditions were: 94°C for 4 minutes; 94°C for 2 second and 74°C for 3 min, 6 cycles 
in which annealing temperature drops 1°C every cycle; 94°C for 2 second and 69*^0 
5 for 3 minutes, 20 cycles; 69°C for 3 minutes. A 1.5 kb genomic fragment was 

amplified and isolated from the P\/L/ll-digested GenomeWalker library. The genomic 
fragment was digested with SamHI and Sail and cloned into Bluescript KS* vector 
for sequencing. DNA sequencing determined that this genomic fragment contained 
a 1408 bp soybean BD30 promoter sequence (SEQ ID NO:6). 
10 Based on the sequences of the soybean p-conglycinin |3-subunit promoter 

sequence in NCBI database (S44893), two oligos with either BamHI or Not\ sites at 
the 5' ends were designed to amplify the soybean p-conglycinin p-subunit promoter 
(SEQ ID NO:7). The oligonucleotide sequences of these two oligos are set forth In 
SEQ ID NOS: 8 and 9. 

15 

CGCGGATCCTATATATGTGAGGGTAGAGGGTATCAC SEQ ID NO:8 

GAATTCGCGGCCGCAGTATATATATTATTGGACGATGAAACATG SEQ ID NO:9 

20 Based on the sequences of the soybean Glycinin Gyl promoter sequence in 

the NCBI GenBank database (X15121), two oligos with either SamHI or Not\ sites at 
the 5' ends were designed to amplify the soybean Glycinin Gyl promoter (SEQ ID 
NO: 10). The oligonucleotide sequences of these two oligos are set forth in SEQ ID 
NOS:11 and 12. 

25 

CGCGGATCCTAGCCTAAGTACGTACTCAAAATGCCA SEQ ID NO:1 1 

GAATTCGCGGCCGCGGTGATGAGTGATGAGTGTTTAAGGAC SEQ ID NO: 12 

30 EXAMPLE 2 

Vector Construction for Characterizing Strong. Seed-specific Promoters 
EPA can be produced at high levels in the seeds of important oil crops, such 
as soy, by strongly expressing each of the individual biosynthetic genes together, in 
a seed specific manner. To reduce the chance of co-suppression, each individual 
35 gene can be operably linked to a different, strong, seed-specific promoter. Because 
the biosynthetic pathway leading to EPA involves the concerted action of a large 
number of different genes, it was necessary to first identify and characterize many 
different promoters that could then be used to express each EPA biosynthetic gene. 
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Promoters were identified and tested for their relative seed-specific strengths by 
linking them to the M. alpina delta-6 desaturase which, in these experiments, acted 
as a reporter gene. The M. alpina delta-6 desaturase can introduce a double bond 
between the C6 and C7 carbon atoms of Hnoleic acid (LA) and a-llnolenic acid 
5 (ALA) to form v-linolenic acid (GLA) and stearidonic acid (STA), respectively. 
Because GLA and STA are not normally found in the lipids of soybean, their , 
presence and concentration in soy was indicative of the relative strength of the 
promoter behind which the delta6 desaturase had been placed. Promoters tested in 
this way are listed in Table 2 and the piasmid construction for each is described 
10 below. 

TABLE 2 



SEED-SPECIFIC P 


ROMOTERS AND VECTORS 


Promoter 


Organism 


Vector Name 


Promoter Reference 


P-conglycinIn a'-subunit 


Soy 


pKR162 


Beachy etal., (1985) 
EMBO J. 4:3047-3053 


Kunitz Trypsin Inhibitor 


Soy 


pKR124 


Jofuku et al.. (1989) Plant 
Cell 1:1079-1093 


annexin 


Soy 


pJS92 


this report '1 


Glycinin Gy1 


Soy 


pZBL119 


this report 


Albumin 28 


Soy 


pKR188 


US Patent 6,177,613 


Legumin A1 


Pea 


pKR189 


Rerie etal. (1991) i\/lo!. 
Gen. Genet. 225:148-157 


P-conglycinin p-subunit 


Soy 


ZBL118 


this report 


BD30 ( also called P34) 


Soy 


pJS93 


this report"! 


Legumin A2 


Pea 


pKR187 


Rerie etal. (1991) Mol. 
Gen. Genet. 225:148-157 



'^This also constitutes the subject matter of Applicant's Assignees's application having Application 
15 No. 60/446,833 (Attorney Docket No. BB1531PRV) filed concurrently herewith. 



The gene for the M. alpina delta-6 desaturase was PCR-amplified from 
pCGR5 using primers oCGR5-1 (SEQ ID NO:13) and oCGR5-2 (SEQ ID NO:14). 
which were designed to introduce Not\ restriction enzyme sites at both ends of the 
20 delta-6 desaturase and an Ncol site at the start codon of the reading frame for the 
enzyme. 
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TTGCGGCCGCAAACCATGGCTGCTGCTCCCAG (SEQ ID NO:13) 

AAGCGGCGGCTTACTGCGCCTTAC (SEQ ID NO:14) 

5 The resulting PGR fragment was subcloned into the intermediate cloning vector 
pCR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to give 
plasmid pKR159. Plasmid pKR159 was then digested with Nott to release the 
M. alpina delta-6 desaturase, which was, in turn, cloned into the Noti site of a 
selected soybean expression vector. Each expression vector tested contained a 

10 Noti site flanked by a suitable promoter and transcription terminator. Each vector 
also contained the hygromycin B phosphotransferase gene [Gritz, L. and Davies, J. 
(1983) Gene 25:179-188], flanked by the T7 promoter and transcription terminator 
(T7prom/hpt/T7term cassette), and a bacterial origin of replication (ori) for selection 
and replication in E. colL In addition, each vector also contained the hygromycin B 

15 phosphotransferase gene, flanked by the 35S promoter [Odell et al., (1985) Nature 
313:810-812] and NOS 3' transcription terminator [Depicker et al., (1982) J. MoL 
AppL Genet 1:561:570] (35S/hpt/NOS3' cassette) for selection in soybean. 

Vector pKR162 was constructed by cloning the Noti fragment of pKR159, 
containing the delta-6 desaturase, into the Noti site of vector KS123. Vector KS123 

20 contains a Noti site flanked by the promoter for the a' subunit of |3-conglycinin and 
the phaseolin 3' transcription terminator elements (pcon/A/of//Phas3' cassette). 

Vector pKR188 was constructed by cloning the Noti fragment of pKR159, 
containing the delta-6 desaturase, into the Nott site of vector pKR135. Vector 
pKR135 contains a Noti site flanked by the 2S albumin promoter and the 2S 

25 albumin 3' transcription terminator elements (SA/A/of//SA3* cassette), Plasmid 
pKR135 was constructed by cloning the BamHl/Sall fragment of pKR132, 
containing the SA/A/of//SA3' cassette, into the BamHl/Sall site of pKS120. Plasmid 
pKS120 is identical to pKS123 except the H/ndlll fragment containing the 
pcon/A/of//Phas3' cassette was removed. Plasmid pKR132, containing the 

30 SA/A/of//SA3' cassette flanked by BamH\ and Sail sites, was constructed by cloning 
theXibal fragment of the SA/A/ofl/SA3' cassette, made by PGR amplification, into 
the Xbal site of pUCI 9. The albumin promoter was amplified from plasmid AL3 
promoter: :pB1 121 (US Patent 6,177,613) using PGR. Primer oSAIb-9 (SEQ ID 
NO:1 5) was designed to introduce an Xbal site at the 5' end of the promoter, and 

35 oSAIb-3 (SEQ ID NO: 16) was designed to introduce a Noti site at the 3' end of the 
promoter. 
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ATCTAGACCTGCAGGCCAACTGCGTTTGGGGCTC (SEQ ID NO:15) 

CTTTTAACTTCGCGGCCGCTTGCTATTGATGGGTGAAGTG (SEQ ID NO: 1 6) 

5 The albumin transcription terminator was amplified from soy genomic DNA using 
primer oSAIb-4 (SEQ ID NO:17), designed to introduce a Not\ site at the 5' end of 
the terminator, and primer oSAIb-2 (SEQ ID NO:18), designed to introduce Ss/WI 
and Xbal sites at the 3' end of the terminator 

10 CAATAGCAAGCGGCCGGGAAGTTAAAAGCAATGTTGTC (SEQ ID NO: 17) 

AATGTAGACGTACGCAAAGGGAAAGATTTAAAGTC (SEQ ID NO:18) 

The resulting PGR fragments were then combined and re-amplified using primers 

15 oSAIb-9 and oSAIb-2, thus forming the SA/A/ofl/SA3' cassette, which was 
subsequently cloned into pUC19 to give pKR132. 

Vector pKR187 was constructed by cloning the Notl fragment of pKR159, 
containing the delta-6 desaturase, into the Not\ site of vector pKR145. Vector 
pl<R145 contains a Notl site flanked by the pea leguminA2 promoter and the pea 

20 leguminA2 3' transcription terminator (legA2/A/of//legA23' cassette). Plasmid 
pKR145 was constructed by cloning the BamHl/Sall fragment of pKR142, 
containing the legA2/A/of//legA23' cassette, into the BamHWSall fragment of KS120 
(described above). The legA2/A/ofl/legA23' cassette of pKR142 was flanked by 
Ss/WI sites and contained a Pst\ site at the extreme 5' end of legA2 promoter. In 

25 addition, this cassette was flanked by BamHI and Sail sites. Plasmid pKR142 was 
constructed by cloning the Ss/WI fragment of pKR140, containing the 
legA2/A/ofl/legA23' cassette, into the Bs/WI site of pKR124, containing a bacterial 
ori and ampicillin resistance gene. This cloning step introduced the Sa/I site and 
allowed further subcloning into pKS124. The legA2/A/o«/IegA23' cassette of 

30 pKR140 was made by PGR amplification from pea genomic DNA. The legA2 
promoter was amplified from pea genomic DNA using primer LegProS' (SEQ ID 
NO:19). designed to introduce Xdal and Ss/WI sites at the 5' end of the promoter, 
and primer LegPro3' (SEQ ID NO:20), designed to introduce a Notl site at the 3' 
end of the promoter. 

35 

TTTCTAGACGTACGTCCCTTCTTATCTTTGATCTCC (SEQ ID NO: 1 9) 

GCGGCCGCAGTTGGATAGAATATATGTTTGTGAC (SEQ ID NO:20) 

35 
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The legA2 transcription terminator was amplified from pea genomic DNA using 
primer LegTermS* (SEQ ID NO:21), designed to introduce Not\ site at the 5' end of 
the terminator, and primer LegTerm3' (SEQ ID NO:22), designed to introduce BsNSIl 
5 and Xbal sites at the 3' end of the terminator. 

CTATCCAACTGCGGCCGCATTTCGCACCAAATCAATGAAAG (SEQ ID NO:21) 

AATCTAGACGTACGTGAAGGTTAAACATGGTGAATATG (SEQ ID NO:22) 

10 

The resulting PGR fragments were then combined and re-amplified using primers 
LegProS' and LegTermS', thus forming the legA2/A/ofl/legA23' cassette. The 
legA2/A/ofl/legA23' cassette PGR fragment was subcloned into the intermediate 
cloning vector pGR-Script AMP SK(+) (Stratagene) according the manufacturer's 

15 protocol to give piasmid pKR140. Plasmid pKR124 contains a Notl site flanked by 
the KTi promoter and the KTi transcription termination region (KTi/A/of//KTi3' 
cassette). In addition, the KTi/A/of//KTi3' cassette was flanked by Ss/WI sites. The 
KTi/Nofl/KTiS' cassette was PCR-ampIified from pKS126 using primers oKTI5 (SEQ 
ID ISIO:23) and oKTi6 (SEQ ID NO:24), designed to Introduce an Xbal and BsNMl 

20 site at both ends of the cassette. 

ATCTAGACGTACGTCCTCGAAGAGAAGGG (SEQ ID NO:23) 

TTCTAGACGTACGGATATAATG (SEQ ID NO:24) 

25 

The resulting PGR fragment was subcloned into the Xbal site of the cloning vector 
pUC19 to give plasmid pKR124. Plasmid pKS126 is similar to pKS121 
(WO 02/00904), the former possessing a second hygromycin phosphotransferase 
gene that is operably linked to a 35S-GaMV promoter. 

30 Vector pKR189 was constructed by cloning the Not\ fragment of pKR159, 

containing the delta-6 desaturase, into the Notl site of vector pKR154. Vector 
pKR154 contains a Notl site flanked by the pea leguminAI promoter and the pea 
leguminA2 3' transcription terminator (leg AM Not I/leg A23' cassette). Vector pKR154 
was made by cloning the HindUl/Notl fragment of pKR151, containing the legAI 3' 

35 promoter into the Hind\\\/Not\ fragment of pKRISO. Plasmid pKR151 contained a 
Notl site flanked by the leguminAI promoter and the leguminAI 3' transcription 
terminator (legA1/A/ofl/legA13' cassette). In addition, the legA1/A/ofl/legA13' 
cassette was flanked by Ss/WI site. The legA1/A/ofl/legA13' cassette was made by 
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PGR amplification from pea genomic DNA. The legAI promoter was PCR-amplified 
using primer LegAI Pro5' (SEQ ID NO:25), designed to introduce Xbal and Bs/WI 
sites at the 5* end of the promoter, and primer LegAI Pro3' (SEQ ID NO:26), 
designed to Introduce a A^ofl site at the 3' end of the promoter. 

5 

TTTCTAGACGTACGGTCTCAATAGATTAAGAAGTTG (SEQ ID NO:25) 

GCGGCCGCGAAGAGAGATACTAAGAGAATGTTG . (SEQ ID NO:26) 

10 The legAI transcription terminator was amplified from pea genomic DNA using 

primer LegA1Term5' (SEQ ID NO:27), which was designed to introduce Noti site at 
the 5' end of the terminator, and primer LegAITermS* (SEQ ID NO:28), which was 
designed to introduce Bsi\N\ and Xba\ sites at the 3' end of the terminator. 

15 GTATCTCTCTTCGCGGCCGCATTTGGCACCAAATCAATG (SEQ ID NO:27) 

TTTCTAGACGTACGTCAAAAAATTTCATTGTAACTC (SEQ ID NO:28) 

The resulting PGR fragments were then combined and re-amplified using primer 

20 LegAI Pro5* and LegAITermS', thus forming the [egA1/A/ofl/legA13' cassette. The 
legA1/Wofl/legA13' cassette PGR fragment was subcloned into the intermediate 
cloning vector pCR-Script AMP SK(+) (Stratagene) according the manufacturer's 
protocol to give plasmid pPLIA. The legA1/A/ofl/legA13' cassette was subsequently 
excised from pPL1 A by digestion with Bs/WI and cloned into the Bs/WI site of 

25 pKR145 (described above) to give pKR151 . Plasmid pKR150 was constructed by 
cloning the SaAnHI/H//7dlll fragment of pKR142 (described above), containing the 
legA2/A/ofl/legA23' cassette into the BamHI/H/ndlll site of KS120 (described above). 

The amplified soybean p-conglycinin p-subunit promoter fragment (as 
described in Example 1) was digested with BamH I and Noti, purified and cloned 

30 into the BamH I and Noti sites of plasmid pZBL1 15 to make pZBL1 16. The 

pZBLIIS plasmid contains the origin of replication from pBR322, the bacterial HPT 
hygromycin resistance gene driven by T7 promoter and T7 terminator, and a 35S 
promoter-HPT-Nos3' gene to serve as a hygromycin resistant plant selection 
marker. The Not I fragment of pKR159, containing the M. alpina delta 6 desaturase 

35 gene, was cloned into Not I site of pZBL1 16 In the sense orientation to make plant 
expression cassettes pZBL118. 

The amplified soybean glycinin Gy1 promoter fragment (described in 
Example 1) was digested with BamH I and Noti, purified and cloned into the BamH I 
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and Not\ sites of plasmid pZBL1 15 to make pZBL1 17. The A/ofl fragment of 
pKR159, containing the M. alpina delta-6 desaturase gene, was cloned into Not\ site 
of pZBL1 17 in the sense orientation to make plant expression cassettes pZBL1 19. 
Based on the sequence of the soybean annexin promoter (SEQ ID NO:3), as 
5 described in Example 1 , two oligos with either BamH 1 or Noti sites at the 5' ends 
were designed to re-amplify the promoter. The oligonucleotide sequences of these 
two oligos are shown in SEQ ID NO:29 and SEQ ID NO:30. 

CGCGGATCCATCTTAGGCCCTTGATTATATGGTGTTT (SEQ ID NO:29) 

10 

GAATTCGCGGCCGCTGAAGTATTGCTTCTTAGTTAACCTTTCC (SEQ ID NO:30) 

Based on the sequences of cloned soybean BD30 promoter (SEQ ID NO:6), 
as described in Example 1, two oligos with either BamH I or Not I sites at the 5' 
1 5 ends were designed to re-amplify the BD30 promoter. The oligonucleotide 

sequences of these two oligos are shown In SEQ ID NO:31 and SEQ ID NO:32. 

CGCGGATCCAACTAAAAAAAGCTCTCAAATTACATTTTGAG (SEQ ID NO:3 1 ) 

20 GAATTCGCGGCCGCAACTTGGTGGAAGAATTTTATGATTTGAAA (SEQ ID 
NO:32) 

The re-ampllfied annexin and BD30 promoter fragments were digested with 
BamH I and Not\, purified and cloned Into the BamH I and Noti sites of plasmid 
25 pZBL1 15 to make pJS88 and pJS89, respectively. The pZBL115 plasmid contains 
the origin of replication from pBR322, the bacterial HPT hygromycin resistance 
gene driven by T7 promoter and T7 terminator, and a 35S promoter-HPT-Nos3' 
gene to serve as a hygromycin resistant plant selection marker. The M. alpina 
delta-6 desaturase gene was cloned into Noti site of pJS88 and pJS89, in the sense 
30 orientation, to make plant expression cassettes pJS92 and pJS93, respectively. 

EXAIVIPLE 3 

Cloning of Individual EPA Biosvnthetic Pathway Genes for Expre ssion In Somatic 

Sovbean Embryos 
Each of the EPA biosynthetic genes was tested individually in order to 
35 assess their activities in somatic soybean embryos before combining for large-scale 
production transformation Into soybean. Each gene was cloned Into an appropriate 
expression cassette as described below. For the M. alpina delta-5 desaturase and 
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elongase, both genes were combined together on one plasnnid. The genes and 
promoters used, and the corresponding vector names are listed in Table 3. 

TABLE 3 



EPA BIOSYNTHETIC GENES EXPRESSED IN SOYBEAN SOMAT 


C EMBRYOS 


Activity 


Source 
Organism 


Sequence 

(DNA) 


Sequence 
(Protein) 


Vector 


Delta-6 
desaturase 


M. alpina 


SEQ ID NO:33 


SEQ ID NO:34 


pKR162 


Delta-6 
desaturase 


S. diclina 


SEQ ID NO:35 


SEQ ID NO:36 


pKS208 


Delta-5 
desaturase 


S. diclina 


SEQ ID NO:37 


SEQ ID NO:38 


pKR305 


elongase 


T. aureum 


SEQ ID NO:39 


SEQ ID NO:40 


pKS209 


Delta- 17 
desaturase 


S. diclina 


SEQ ID NO:41 


SEQ ID NO:42 


pKS203 


elongase 


M. alpina 


SEQ ID NO:43 


SEQ ID NO:44 


pKS134 


Delta-5 
desaturase 


M. alpina 


SEQ ID NO:45 


SEQ ID NO:46 


pKS134 



Construction of pKR162, for soy expression studies with the M. alpina deita-6 
desaturase. was described in Example 2. 

The S. diclina delta-6 desaturase was cloned into the Not\ site of the 

10 pcon/A/of//Phas3' cassette of vector pKS123. The gene for the S. diclina delta-6 
desaturase was removed from pRSP1 by digestion with EcoRI and HindlW. The 
ends of the resulting DNA fragment were filled and the fragment was cloned into the 
filled Notl site of pKS123 to give pKS208. 

To release the S. diclina delta-5 desaturase from plasmid pRSP3, It was first 

15 digested with Xliol, the Xhol ends were filled, and the plasmid was then digested 
with EcoRL The deita-5 desaturase-containing fragment was then cloned into 
pKR288 that had been digested with Mfel and EcoRV to give pKR305. Plasmid 
pKR288 was identical to pKS123 except that a linker containing the Mfe\ (on the 
promoter side) and EcoRV (on the 3' terminal side) sites had been inserted Into the 

20 Notl site of the pcon/A/of//Phas3' cassette. This allowed for directional cloning of 
the delta-5 desaturase, which contained internal Notl sites, into pKS123. 
Construction of pKR288 is more thoroughly described in Example 13. 
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The T. aureum elongase was cloned into the A/ofl site of the 
pcon/A/of//Phas3' cassette of vector pKS123. The gene for the T. aureum elongase 
was removed from pRAT-4-A7 by digestion with EcoRI. The ends of the resulting 
DNA fragment were filled and the fragment was cloned into the filled Notl site of 

5 pKS123 to give pKS209. 

The gene for the S. diclina delta-17 desaturase (Example 6) was amplified 
from PRSP19 using primers RSP19forward (SEQ ID NO:53) and RSP19reverse 
(SEQ ID NO:54) which were designed to introduce A/ofl restriction enzyme sites at 
both ends of the delta-17 desaturase. 

0 

GCGGCCGCATGACTGAGGATAAGACGA (SEQ ID NO:53) 

GCGGCCGCTTAGTCCGACTTGGCCTTG (SEQ ID NO:54) 

5 The resulting PCR fragment was subcloned into the intermediate cloning vector 
pGEM-T easy (Promega) according the manufacturer's protocol to give plasmid 
pRSP19/pGEM. The gene for the S. diclina delta-17 desaturase was released from 
pRSP19/pGEM by partial digestion with A/ofl and cloned into the /S/ofl site of 
pKS123 to give pKS203. 

20 In plasmid pKS134, both the iW. alpina elongase and /W. alpina delta-5 

desaturase were cloned behind the p-conglycinin promoter followed by the 
phaseolin 3' transcription terminator (p con/Maelo/Phas3' cassette, 
pcon/Mad5/Phas3* cassette). Plasmid pKS134 was constructed by cloning the 
H/ndlll fragment of pKS129, containing the pcon/Mad5/Phas3' cassette, into a 

25 Hind\\\ site of partially digested pKS128, containing the pcon/Maelo/Phas3' 

cassette, the T7prom/hptAT7temn cassette and the bacterial on region. The gene for 
the /W. alpina elongase was amplified from pRPB2 using primers RPB2foward (SEQ 
ID NO:55) and RPB2reverse (SEQ ID NO:56) which were designed to introduce 
A/ofl restriction enzyme sites at both ends of the elongase. 

30 

GCGGCCGCATGGAGTCGATTGCGC (SEQ ID NO:55) 

GCGGCCGCTTACTGCAACTTCCTT (SEQ ID NO:56) 

35 The resulting PCR fragment was digested with Nott and cloned into the A/ofl site of 
pKS119. containing a pcon/A/ofl/Phas3' cassette, the T7prom/hpt/T7temi cassette 
and the bacterial on region, to give pKS128. Plasmid pKS1 19 is identical to 
pKS123, except that the 35S/HPT/NOS3' cassette had been removed. The gene 
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for the M. alpina delta-5 desaturase was amplified from pCGR4 using primers 
CGR4foward (SEQ ID NO:57) and CGR4reverse (SEQ ID NO:58) which were 
designed to introduce A/of! restriction enzyme sites at both ends of the desaturase, 

5 GCGGCCGCATGGGAACGGACCAAG (SEQ ID NO:57) 

GCGGCCGCCTACTCTTCCTTGGGA (SEQ ID NO:58) 

The resulting PGR fragment was digested with Not\ and cloned into the Not\ site of 
10 pKS1 19, containing a pcon/A/ofl/Phas3' cassette flanked by H/ndlll sites, to give 
pKS129. 

EXAMPLE 4 

Assembling EPA biosvnthetic pathway genes for expression in Somatic Soybean 

15 Embryos and Soybean Seeds fpKR274) 

The M. alpina delta-S desaturase, M. alpina elongase and M. alpina delta-5 
desaturase were cloned into plasmid pKR274 (Figure 3) behind strong, seed- 
specific promoters allowing for high expression of these genes in somatic soybean 
embryos and soybean seeds. The delta-6 desaturase was cloned behind the 

20 promoter for the a' subunit of p-conglycinin [Beachy et al., (1985) EMBO J. 

4:3047-3053] followed by the 3' transcription termination region of the phaseolin 
gene [Doyle. J.J. et al. (1986) J. BioL Chem. 261:9228-9238] (Pcon/Mad6/Phas3' 
cassette). The delta-5 desaturase was cloned behind the Kunitz soybean Trypsin 
Inhibitor (KTi) promoter [Jofuku et aL, (1989) P/anf Ce// 1:1079-1093], followed by 

25 the KTi 3' termination region, the isolation of which is described in US Patent 

6,372,965 (KTi/Mad5/KTi3' cassette). The elongase was cloned behind the glyclnin 
Gyl promoter followed by the pea leguminA2 3' termination region 
(Gy1/Maelo/legA2 cassette). All of these promoters exhibit strong tissue specific 
expression in the seeds of soybean. Plasmid pKR274 also contains the hygromycin 

30 B phosphotransferase gene [Gritz, L. and Davies, J. (1983) Gene 25:179-188] 

cloned behind the T7 RNA polymerase promoter and followed by the T7 terminator 
(T7prom/HPT/T7term cassette) for selection of the plasmid on hygromycin B in 
certain strains of E. coli, such as NovaBlue(DE3) (Novagen, Madison, Wl), which is 
lysogenic for lambda DE3 (and carries the T7 RNA polymerase gene under lacUVS 

35 control). In addition, plasmid pKR274 contains a bacterial origin of replication (on) 
functional in E. coli from the yector pSP72 (Stratagene). 

Plasmid pKR274 was constructed in many steps from a number of different 
intermediate cloning yectors. The Gy1/Maelo/legA2 cassette was released from 
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plasmid pKR270 by digestion witli Bsi\N\ and Sbn and was cloned into the 
BsiSNUSbn sites of plasmid pKR269, containing the delta-6 desaturase. the 
T7prom/hpt/T7term cassette and the bacterial ori region. This was designated as 
plasmid pKR272. The KTi/IVIad5/KTi3' cassette, released from pKR136 by digestion 
with es/WI, was then cloned into the 8s/WI site of pKR272 to give pKR274. A 
description for plasmid construction for pKR269, pKR270 and pKR136 is provided 
below. 

Plasmid pKR159 (described in Example 2) was digested with A/ofl to release 
the M. alpina delta-6 desaturase. which was, in turn, cloned into the WofI site of the 
soybean expression vector pKR197 to give pKR269. Vector pKR197 contains a 
pcon/A/of//Phas3' cassette, the T7prom/hptn7term cassette and the bacterial ori 
region. Vector pKR197 was constructed by combining the Asc\ fragment from 
plasmid pKS102 (WO 02/00905), containing the T7prom/hpt/T7temi cassette and 
bacterial ori, with the Asc\ fragment of plasmid pKR72, containing the 
pcon/A/o«/Phas cassette. Vector pKR72 is identical to the previously described 
vector pKS123 (WO 02/08269), except that Sbfl, Fse\ and Bs/WI restriction enzyme 
sites were introduced between the H/ndlll and SamHI sites in front of the p- 
conglycinin promoter. 

The gene for the M. alpina elongase was PCR-amplified (described in 
Example 3) digested with Not\ and cloned into the Nott site of vector pKR263 to give 
pKR270. Vector pKR263 contains a /Vofl site flanked by the promoter for the 
glycininGyl gene and the leguminA2 3' transcription temnination region 
(Gy1/A/o«/legA2 cassette). In addition, the Gy1/A/ofl/legA2 cassette was flanked by 
Sbf\ and Ss/WI sites. Vector pKR263 was constructed by combining the PsWNott 
fragment from plasmid pKR142, containing the leguminA2 3' transcription 
termination region, an ampicillin resistance gene and bacterial ori with the Pstt/Nott 
fragment of plasmid pSGIy12. containing the glycininGyl promoter. The 
glycininGyl promoter was amplified from pZBL1 19 (described in Example 2) using 
primer oSGIy-1 (SEQ ID NO:59), designed to introduce an Sbf\/Pst\ site at the 5' 
end of the promoter, and primer oSGIy-2 (SEQ ID NO:60). designed to introduce a 
Not] site at the 3' end of the promoter. 

TTGGTGGAGGCTAGCCTAAGTACGTACTC (SEQ ID NO:59) 

AAGCGGCCGCGGTGATGACTG (SEQ ID NO:60) 

The resulting PGR fragment was subcloned Into the intermediate cloning 
vector pGR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to 

42 



wo 2004/071467 



PCTAJS2004/005758 



give plasmid pSGIy12. Construction of pKR142, containing the legA2/A/ofl/legA23' 
cassette is described in Example 2. The gene for the M. alpina delta-5 desaturase 
was PCR-amplified as described in Example 3, digested with A/ofl and cloned into 
the A/ofl site of vector pKR124 (described in Example 2) to give pKR136. 
5 EXAMPLE 5 

Assembling EPA biosvnthetic pathwav genes for expression in Somatic Soybean 

Embryos and Soybean Seeds fDKKE2) 
The S. diclina delta-6 desaturase, M. alpina elongase and M, alpina delta-5 
desaturase were cloned into plasmid pKKE2 (Figure 4) behind strong, seed-specific 
10 promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. Plasmid pKKE2 was identical to pKR274, described in 
Example 4, except that in pKKE2 the M. alpina delta-6 desaturase was replaced 
with the S. diclina delta-6 desaturase. As in pKR274, the S. diclina delta-6 
desaturase was cloned behind the promoter for the a' subunit of p-conglycinin 
15 followed by the 3' transcription termination region of the phaseolin gene 
(Pcon/Sdd6/Phas3' cassette). 

Plasmid pKKE2 was constructed from a number of different intermediate 
cloning vectors as follows: The pcon/Sdd6/Phas3' cassette was released from 
plasmid pKS208 (described in Example 2) by digestion with HindlW and was cloned 
20 into the HindlW site of plasmid pKR272 (Example 3) to give pKR301 . The 

KTi/Mad5/KTi3' cassette, released from pKR136, (Example 4) by digestion with 
Bs/WI, was then cloned into the BsiWl site of pKR301 to give pKKE2. 

EXAMPLE 6 

Cloning of S.diclina (ATCC 56851) delta-17 desaturase 
25 Construction of Saoroleania diclina (ATCC 56851) cDNA Library 

To isolate genes encoding for functional desaturase enzymes, a cDNA library 
was constructed. Sapmlegnia diclina cultures were grown in potato dextrose media 
(Difco # 336, BD Diagnostic Systems, Sparks, MD) at room temperature for four 
days with constant agitation. The mycelia were harvested by filtration through 
30 several layers of cheesecloth, and the cultures were crushed in liquid nitrogen using 
a mortar and pestle. The cell lysates were resuspended in RT buffer (Qiagen, 
Valencia, California) containing p-mercaptoethanol and incubated at 55°C for 
three minutes. These lysates were homogenized either by repeated aspirations 
through a syringe or over a "Qiash redder" -brand column (Qiagen). The total RNA 
35 was finally purified using the "RNeasy Maxi"-brand kit (Qiagen), as per the 
manufacturer's protocol. 

mRNA was isolated from total RNA from each organism using an oligo dT 
cellulose resin; The "pBluescript II XR"-brand library construction kit (Stratagene, La 
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Jolla. CA) was used to synthesize double-stranded cDNA. The double-stranded 
cDNA was then directionally cloned (5' EcoR//3' Xhol) into pBluescript II SK(+) 
vector (Stratagene). The S. diclina library contained approximately 2.5 x 10© clones, 
each with an average insert size of approximately 700 bp. Genomic DNA of S. 
diclina was isolated by crushing the culture in liquid nitrogen followed by purification 
using the "Genomic DNA Extraction"-brand kit (Qiagen), as per the manufacturer's 
protocol. 

Determination of Codon Usage in Saoroleania diclina 

The 5' ends of 350 random cDNA clones were sequenced fi-om the 
Sapmlegnia diclina cDNA library described above. The sequences were translated 
into six reading frames using GCG program (Genetics Computer Group, Madison. 
Wl) with the "FastA'-brand algorithm to search for similarity between a query 
sequence and a group of sequences of the same type, specifically within the 
GenBank database. Many of the clones were identified as putative housekeeping 
genes based on protein homology to known genes. Eight S. diclina cDNA 
sequences were thus selected. Additionally, the full-length S. diclina delta 5- 
desaturase and delta 6-desaturase sequences were also used (see Table 4 below). 
These sequences were then used to generate the S. diclina codon bias table shown 
in Table 2 below by employing the "CodonFrequency" program from GCG (Madison, 
Wl). 

TABLE 4 



Clone 


Database Match 


# bases 


# amino 
acids 


3 


Actin gene 


615 


205 


20 


Ribosomal protein L23 


420 


140 


55 


Heat Shock protein 70 gene 


468 


156 


83 


Glyceraldehyde-3-P-dehydrogenase 
gene 


588 


196 


138 


Ribosomal protein S13 gene 


329 


110 


179 


Alpha-tubulin 3 gene 


591 


197 


190 


Casein kinase II alpha subunit gene 


627 


209 


250 


Cyclophilin gene 


489 


163 




Delta 6-desaturase 


1362 


453 
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Clone 


Database Match 


# bases 


# amino 
acids 




Delta 5-desaturase 


1413 


471 




Total 


6573 


2191 



TABLE 5 



CODON 


BIAS TABLE FOR Saprolegnia diclina 


Amino acid 


Codon Bias 


% used 


Ala 


GCC 


55% 


Arg 


CGC 


50% 


Asn 


AAC 


94% 


Asp 


GAC 


85% 


Cys 


TGC 


77% 


Gin 


CAG 


. 90% 


Glu 


GAG 


80% 


Gly 


GGC 


67% 


His 


CAC 


86% 


lie 


ATC 


82% 


Leu 


CTC 


52% 


Lys 


AAG 


87% 


Met 


ATG 


100% 


Phe 


TTC 


72% 


Pro 


CCG 


55% 


Ser 


TCG 


47% 


Thr 


ACG 


46% 


Trp 


TGG 


100% 


Tyr 


TAC 


90% 


Val 


GTC 


73% 


Stop 


TGA 


67% 
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n^^inn of D^aen ^r^*^ niinnnucle otiH^c fnrth^ Isolation of an Omeqa-3 Desaturase 
from Saoroleiania did '"'* (ATnC 56851) 

The method for identification of a delta-17 desaturase (an omega-3 
desaturase) gene from S. diclina involved PGR amplification of a region of the 
putative desaturase gene using degenerate oligonucleotides (pnmers) that 
contained conserved motifs present in other known omega-3 desaturases. Omega- 
3 desaturases from the follov^ing organisms were used for the design of these 
degenerate primers: Arabidopsis thaliana (Swissprot# P46310). Ricunus communis 
(Swissprot # P48619). Glycine max (Swissprot # P48621). Sesamum mdicum 
(Swissprot # P48620). Nicotiana tabacum (GenBank # D79979). Perilla frutescens 
(GenBank # U59477). Capsicum annuum (GenBank # AF222989). Limnanthes 
douglassi (GenBank # U17063). and Caenorhabditis elegans (GenBank # L41807). 
Some primers were designed to contain the conserved histidine-box motifs that are 
important components of the active site of the enzymes. See Shanklin, J.E., 
McDonough, V.M.. and IViartin, C.E. (1994) Biochemistn^ 33, 12787-12794. 

Alignment of sequences was carried out using the CLUSTALW Multiple 
Sequence Alignment Program (Thompson. J.D. et al. (1994) NucL Acids Res. 
22:4673-4680). 

The following degenerate primers were designed and used in vanous 
combinations: 

Protein Motif 1 : NH3- TRAAIPKHCWVK -COOH (SEQ ID NO:61 ) 

Primer RO 1 144 (Fonward): 

ATCCGCGCCGCCATCCCCAAGCACTGCTGGGTCAAG (SEQ ID NO: 62) 

Protein Motif 2: NH3- ALFVLGHDCGHGSFS -COOH (SEQ ID NO:63) 

This primer contains the histidine-box 1 (underlined). 

Primer RO 1 1 1 9 (Fonward): 

GCCCTCTTCGTCCTCGGCCAYGACTGCGGCCAYGGCTCGTTGTCG 
(SEQ ID NO: 64). 

Primer RO 1 1 1 8 (Reverse): 

GAGRTGGTARTGGGGGATCTGGGGGAAGARRTGRTGGRYGACRTG 
(SEQ ID NO: 65). 
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Protein Motif 3: NH3- PYHGWRIS HRTHH QN -COOH (SEQ ID NO:66) 

This primer contains the histidine-box 2 (underlined). 

Primer RO 1121 (Fonward): 
5 CCCTACCAYGGCTGGCGCATCTCGCAYCGCACCCAYCAYCAGAAC 
(SEQ ID NO: 67). , 

Primer RO 1122 (Reverse): 

GTTCTGRTGRTGGGTCCGRTGCGAGATGCGCCAGCCRTGGTAGGG 
10 (SEQ ID NO: 68). 

Protein IVIotif 4: NH3- GSHF D/H P DA' SDLFV -COOH (SEQ ID NO:69) 

Primer RO 1 146 (Fonward): 
15 GGCTCGCACTTCSACCCCKACTCGGACCTCTTCGTC (SEQ ID NO: 70). 

Primer RO 1147 (Reverse): 

GACGAAGAGGTCCGAGTMGGGGTWGAAGTGCGAGCC (SEQ ID NO: 71). 
20 Protein Motif 6: NH3- WS Y/F LTV RGGLTT U\ DR -COOH (SEQ ID NO:72) 
Primer RO 1 148 (Reverse): 

GCGCTGGAKGGTGGTGAGGCCGCCGCGGAWGSACGACCA 
(SEQ ID NO: 73) 

25 

Protein Motif 6: NH3- HHDIGThMHHLFPQ -COOH (SEQ ID NO:74) 
This sequence contains the third histidine-box (underiined). 

Primer RO 1114 (Reverse): 
30 CTGGGGGAAGAGRTGRTGGATGACRTGGGTGCCGATGTCRTGRTG 
(SEQ ID NO: 75). 

Protein Motif 7: NH3- H L/F FP Q/K IPHYHL V/l EAT -COOH (SEQ ID NO:76) 

35 Primer RO 1 1 1 6 (Reverse): 

GGTGGCCTCGAYGAGRTGGTARTGGGGGATCTKGGGGAAGARRTG 
(SEQ ID NO: 77). 
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Protein Motif 8: NH3- HVA/IHH L/F FPQIPHYHL -COOH (SEQ ID NO:78) 

This primer contains the third histidine-box (underlined) and accounts for differences 

between the plant omege-3 desaturases and the C. elegans omega-3-desaturase. 

The nucleic acid degeneracy code used for SEQ. ID. NOS: 62 through 77 was as 
follows. R= A/G; Y=C/T; M=A/C; K=G/T; W=An-; S=C/G; B=C/G/T; D=A/G/T; 
H=A/C/T: V=A/C/G; and N=A/C/Gn". 

iH^ntification apH i^niptinn of D e it^-i7 D^saturase Gene from Saproleqnia diclina 
rATCC 56851) 

Various sets of the degenerate primers above were used in PGR 
amplification reactions, using as a template either the S. diclina cDNA library 
plasmid DNA. or S. diclina genomic DNA. Also various different DNA polymerases 
and reaction conditions were used for the PGR amplifications. These reactions 
variously involved using "Platinum Taq"-brand DNA polymerase (Life Technologies 
Inc.. Rockville, MD), or cDNA polymerase (Clontech. Palo Alto. CA). or Taq PGR- 
mix (Qiagen), at differing annealing temperatures. 

PGR amplification using the primers RO 1121 (Fonward) (SEQ. ID. NO:67) 
and RO 1 1 16 (Reverse) (SEQ. ID. NO:77) resulted in the amplification of a fragment 
homologous to a known omega-3 desaturase. The RO 1 121 (Fonward) primer 
corresponds to the protein motif 3; the RO 1 1 16 (Reverse) primer con-esponds to 

the protein motif 7. 

PGR amplification was carried out in a 50 p.1 total volume containing: 3 pi of 
the cDNA library template, PGR buffer containing 40 mM Tricine-KOH (pH 9.2). 
15 mM KOAc. 3.5 mM Mg(OAc)2. 3.75 pg/ml BSA (final concentration). 200 pM 
each deoxyribonucleotide triphosphate, 10 pmole of each primer and 0.5 pi of 
-Advantage"-brand cDNA polymerase (Glontech). Amplification was carried out as 
follows: Initial denaturation at 94"G for 3 minutes, followed by 35 cycles of the 
following: 94"G for 1 min. 60"'G for 30 sec, 72°G for 1 min. A final extension cycle of 
72°C for 7 min was carried out, followed by reaction termination at 4°G. 

A single -480 bp PGR band was generated which was resolved on a 1% 
"SeaKem Gold"-brand agarose gel (FMG BioProducts. Rockland. ME), and gel- 
purified using the Qiagen Gel Extraction Kit. The staggered ends on the fragment 
were "fi!led-in" using T4 DNA polymerase (Life Technologies. Rockville. MD) as per 
the manufacturer's instructions, and the DNA fragments were cloned into the PCR- 
Blunt vector (Invitrogen. Garlsbad. GA). The recombinant plasmids were 
transformed into TOP10 supercompetent cells (Invitrogen). and eight clones were 
sequenced and a database search (Gen-EMBL) was earned out. 
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Clones "sdd17-7-1" to "sdd17-7-8' were all found to contain and --483 bp 
insert. The deduced amino acid sequence from this fragment showed highest 
identity to the following proteins (based on a "tFastA" search): 

1. 37.9% identity in 161 amino acid overlap with an omega-3 (delta-15) 
6 desaturase from Synechocystis sp. (Accession # D 13780). 

2. 40.7% identity in 1 13 amino acid overlap with Picea abies plastidic 
omega-3 desaturase (Accession # AJ302017). 

3. 35.9% identity in 128 amino acid overlap with Zea mays FADS fatty acid 
desaturase (Accession # D63953). 

10 Based on its sequence homology to known omega-3 fatty acid desaturases, it 

seemed likely that this DNA fragment was part of a delta-17 desaturase gene 
present in S. dicHna. 

The DNA sequence identified above was used in the design oligonucleotides 
to isolate the 3' and the 5' ends of this gene from the S. diclina cDNA library. To 

15 isolate the 3' end of the gene, the following oligonucleotides were designed: 

RO 1188 (Forward): 5'-TACGCGTACCTCACGTACTCGCTCG-3' (SEQ ID NO: 79) 
RO 1189 (Forward): TTCTTGCACCACAACGACGAAGCGACG (SEQ ID NO: 80) 

20 

RO 1190 (Forward): GGAGTGGACGTACGTCAAGGGCAAC (SEQ ID NO: 81) 

RO 1191 (Forward): TCAAGGGCAACCTCTCGAGCGTCGAC (SEQ ID NO: 82) 

25 These primers (SEQ ID NOS: 79-82) were used in combinations with the 

pBluescript SK(+) vector oligonucleotide: 

RO 898: 5'-CCCAGTCACGACGTGTAAAA CGACGGCCAG-3' (SEQ ID NO: 83). 
PGR amplifications were carried out using either the "Taq PGR Master Mix" 

brand polymerase (Qiagen) or "Advantage"-brand cDNA polymerase (Glontech) or 
30 "Platinum"-brand Taq DNA polymerase (Life Technologies), as follows: 

For the "Taq PGR Master Mix" polymerase, 10 pmoles of each primer were 

used along with 1 pi of the cDNA library DNA from Example 1 . Amplification was 

carried out as follows: initial denaturation at 94''G for 3 min, followed by 35 cycles of 

the following: 94*'G for 1 min, 60^*0 for 30 sec, 72^*0 for 1 min. A final extension 
35 cycle of 72**G for 7 min was carried out, followed by the reaction termination at 4**C. 

This amplification resulted in the most distinct bands as compared with the other 

two conditions tested. 
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For the '•Advantage"-brand cDNA polymerase reaction. PGR amplification 
was carried out in a 50 Ml total volume containing: 1 pi of the cDNA library template 
from Example 1. PGR buffer containing 40 mM Tricine-KOH (pH 9.2). 15 mM KOAc. 
3 5 mM Mg(OAc)2. 3.75 pg/ml BSA (final concentration), 200 mM each 
deoxyribonucleotide triphosphate. 10 pmole of each primer and 0.5 [i\ of cDNA 
polymerase (Glontech). Amplification was carried out as described for the Taq PGR 
Master Mix. 

For the "Platinum-'-brand Taq DNA polymerase reaction, PGR amplification 
was carried out in a 50 pi total volume containing: 1 pi of the cDNA library template 
from Example 1 . PGR buffer containing 20 mM Tris-GI. pH 8.4. 50 mM KGI (final 
concentration), 200 pM each deoxyribonucleotide triphosphate. 10 pmole of each 
primer 1.5 mM MgS04, and 0.5 pi of Platinum Taq DNA polymerase. Amplification 
was carried out as follows: initial denaturation at 94°°C for 3 min. followed by 
30 cycles of the following: 94°G for 45 sec. 55°G for 30 sec. 68°G for 2 min. The 
reaction was terminated at 4*'G. 

All four sets of primers in combination with the vector primer generated 
distinct bands. PGR bands from the combination (RO 1188 + RO 898) were 
>500 bp and this was gel-purified and cloned separately. The PGR bands 
generated from the other primer combinations were <500 bp. The bands were gel- 
purified, pooled together, and cloned into PCR-Blunt vector (Invitrogen) as 
described eariier. The recombinant plasmids were transformed into TOP10 
supercompetent cells (Invitrogen) and clones were sequenced and analyzed. 

Clone '"sdd 17-1 6-4" and "sdd16-6" containing the -500 bp insert, and clones 
"sdd17-17-6," "sdd 17-1 7-10," and "sdd17-20-3." containing the -400 bp Inserts, 
were all found to contain the 3'-end of the putative delta-17 desaturase. These 
sequences overiapped with each other, as well as with the originally Identified 
fragment of this putative omega-3 desaturase gene. All of the sequences contained 
the TAA- stop codon and a poly-A tail typical of 3'-ends of eukaryotic genes. This 
3'-end sequence was homologous to other known omega-3 desaturases, sharing 
the highest identity (41 .5% in 130 amino acid overlap) with the Synechocystis delta- 
15 desaturase (Accession # D13780). 

For the isolation of the 5'-end of the this gene, the following oligonucleotides 
were designed and used in combinations with the pBluescript SK(+) vector 
oligonucleotide: 

RO 899: 5'- AGGGGATAAGAATTTGAGAGAGGAAAGAGG -3' (SEQ ID NO: 84) 

RO 1185 (Reverse):GGTAAAAGATGTGGTGGTTGTGGATGTTGG (SEQ ID NO: 
85). 
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RO 1186 (Reverse): 5'-GTCAAAGTGGCTCATCGTGC-3' (SEQ ID NO: 86) 
RO 1187 (Reverse): CGAGCGAGTACGTGAGGTACGCGTAC (SEQ ID NO: 87) 

5 

Amplifications were carried out using either the "Taq PGR Master Mlx"-brand 
polymerase (Qiagen) or the "Advantage"-brand cDNA polymerase (Clontech) or the 
"Platinum"-brand Taq DNA polymerase (Life Technologies), as described 
hereinabove for the 3' end isolation. 

10 PGR bands generated from primer combinations (RO 1185 or RO 1 186 + RO 

899) were between -580 to --440 bp and these were pooled and cloned into PGR- 
Biunt vector as described above. Clones thus generated included "sdd 17-1 4-1 
"sdd 17-1 4-10," "sdd 17-1 8-2," and "sdd17-18-8." all of which showed homology with 
known omega-3 desatu rases. 

15 Additionally, bands generated from (RO 1 187 + RO 899) were - 680 bp, and 

these were cloned separately to generate clones "sdd 17-22-2" and "sdd 17-22-5" 
which also showed homology with known omega-3 desaturases. Ail these clones 
overlapped with each other, as well as with the original fragment of the unknown 
putative delta-17 desaturase. These sequences contained an 'ATG' site followed by 

20 an open reading frame, indicating that it is the start site of this gene. These 

fragments showed highest identity (39.7% in 146 arhino acid overlap) with the delta- 
15 desaturase from Calendula officinalis (Accession # AJ245938). 

The full-length reading frame for this delta-17 desaturase was obtained by 
PGR amplification of the S. diclina cDNA library using the following oligonucleotides: 

25 

RO 1212(Fonward): 

5'-TCAAGAGAATTCATGAGGGAGGATAAGAGGAAGGTGGAGTTGCGG-3' 
(SEQ ID NO: 88) 

30 This primer contains the 'ATG' start site (single underline) followed by the 5* 

sequence of the omega-3 desaturase. In addition, an EcoR\ site (double underline) 
was introduced upstream of the start site to facilitate cloning into the yeast 
expression vector pYX242. 

35 RO 1213 (Reverse): 

S'-AAAAGAAAGGTTGGGTTCCTAGTGHAGTGGGAGTTGGGGTTGGG-S' 
(SEQ ID NO: 89) 
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This primer contains the 'TAA' stop codon (single underline) of the gene as well as 
sequence downstream from the stop codon. This sequence was included because 
regions within the gene were very G+C rich, and thus could not be included in the 
design of oligonucleotides for PGR amplification. In addition, a H/ndlll s.te (double 
underline) was included for convenient cloning into a yeast expression vector 
nVX242 

PGR amplification was carried out using the "Taq PGR Master Mlx--brand 
polymerase (Qiagen). 10 pmoles of each primer, and 1 mI of the cDNA library DNA 
from Example 1 . Amplification was carried out as follows: initial denaturation at 
0 94''G for 3 min. followed by 35 cycles of the following: 94-0 for 1 min. SO'C for 30 
sec. 72°C for 1 min. A final extension cycle of ITC for 7 min was earned out. 
followed by the reaction termination at 4°C. 

A PGR band of ~1 Kb was thus obtained and this band was isolated, purified, 
cloned into PGR-Blunt vector (Invitrogen), and transfonned into TOP10 cells. The 
5 inserts were sequenced to verify the gene sequence. Glone "sdd17-27-2'' was 
digested with EcoR\ I H/ndlll to release the full-length insert, and this insert was 
cloned into yeast expression vector pYX242. previously digested with EcoRI / 
H/ndlll. This construct contained 1077 bp of sdd17 cloned into pYX242. This 
construct was labeled pRSP19. 
20 FXAMPLE 7 

Aooomhi y nf FPA biosv "th^tir. nathwav fienes for expression in Romatic Soybean 

Fmbrvos and Soybea n Seeds (pKR275) 
The Arabidopsis Fad3 gene [Yadav. N.S. et al. (1993). Plant Physiol. 
103-467-76] and S. dicllna delta-17 desaturase were cloned into plasmid pKR275 
25 (Figure 5) behind strong, seed-specrfic promoters allowing for high expression of 
these genes in somatic soybean embryos and soybean seeds. The Fad3 gene 
SEQ ID NO-47. and its protein translation product In SEQ ID NO:48. was cloned 
behind the KTi promoter, and upstream of the KTi 3' termination region 
(KTI/Fad3/KTI3' cassette). The S. diclina delta-17 desaturase was cloned behind 
30 the soybean annexin promoter followed by the soy BD30 3' termination region 

(Ann/Sdd17/BD30 cassette). Plasmid pKR275 also contains a mutated form of the 
soy acetolactate synthase (ALS) that is resistant to sulfonylurea herbicides. ALS 
catalyzes the first common step in the biosynthesis of the branched chain ammo 
acids isoleucine. leucine, and valine (Keeler et al. Plant Physiol 1993 102: 1009-18) 
35 inhibition of native plant ALS by several classes of structurally ^^^--^'^f " 

including sulfonylureas, imidazollnones. and triazolopyrimidines. is lethal (Ghong 
CK CholJD B/oc/iemB/ophys Res Commun 2000 279.462-7). Overexpression of 
the mutated sulfonylurea-resistant ALS gene allows for selection of transformed 
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plant cells on sulfonylurea herbicdes. The ALS gene is cloned behind the SAMS 
promoter (described in WO 00/37662). This expression cassette is set forth in SEQ 
ID NO:90. In addition, plasmid pKR275 contains a bacterial on region and the 
T7prom/HPT/T7term cassette for replication and selection of the plasmid on 
5 hygromycin B in bacteria. 

Plasmid pKR275 was constructed from a number of different intermediate 
cloning vectors as follows: The KTi/Fad3/KTi3' cassette was released from plasmid 
pKR201 by digestion with Ss/WI and was cloned into the Bs/WI site of plasmid 
pKR226, containing the ALS gene for selection, the T7prom/hpt/T7term cassette 

10 and the bacterial on region. This was designated plasmid pKR273. The 

Ann/Sdd17/BD30 cassette, released from pKR271 by digestion with Pstl, was then 
cloned into the Sbfl site of pKR273 to give pKR275. A detailed description for 
plasmid construction for pKR226, pKR201 and pKR271 is provided below. 

Plasmid pKR226 was constructed by digesting pKR218 with Bs/WI to remove 

15 the legA2/Notl/legA3' cassette. Plasmid pKR218 was made by combining the filled 
HindWUSbn fragment of pKR217, containing the legA2/A/ofl/legA23' cassette, the 
bacterial on and the T7prom/HPT/T7term cassette, with the Pstt/Sma\ fragment of 
pZSLISIeuB, containing the SAMS/ALS/ALS3' cassette. Plasmid pKR217 was 
constructed by cloning the BamHI/Hindlll fragment of pKR142 (described in 

20 Example 2), containing the legA2/A/ofl/legA23' cassette, into the BaATiHI/H/ndlll site 
of KS102. The Arabidopsis Fad3 gene was released from vector pKS131 as a A/ofI 
fragment and cloned into the Not\ site of pKR124 (described in Example 2) to form 
pKR201. The Notl fragment from pKS131 is identical to that from pCF3 [Yadav, 
N.S. et a! (1993) Plant PhysioL 103:467-76]) 

25 The gene for the S. diclina delta-17 desaturase was released from 

pRSP19/pGEM (described in Example 2) by partial digestion with Nott, and it was 
then cloned into the Notl site of pKR268 to give pKR271. Vector pKR268 contains 
a Notl site flanked by the annexin promoter and the BD30 3' transcription 
termination region (Ann/A/of//BD30 cassette). In addition, the Ann/A/ofl/BD30 

30 cassette was flanked by Pst\ sites. 

To construct pKR268, the annexin promoter from pJS92 was released by 
BamHl digestion and the ends were filled. The resulting fragment was ligated into 
the filled BsiWI fragment of pKR124 (described in Example 2), containing the 
bacterial ori and ampicillin resistance gene, to give pKR265. This cloning step 

35 added Sbfl, Pst\ and Bs/WI sites to the 5' end of the annexin promoter. The 

annexin promoter was released from pKR265 by digestion with Sbfl and Noti and 
was cloned into the Sbfl/Not\ fragment of pKR256, containing the BD30 3' 
transcription terminator, an ampicillin resistance gene and a bacterial ori region, to 
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give pKR268. Vector pKR256 was constructed by cloning an EcoRI/A/ofI fragment 
from pKR251r. containing the BD30 3' transcription terminator, into the EcoRI/A/ofl 
fragment of Intermediate cloning vector pKR227. This step also added a Pst\ site to 
the 3' end the BD30 3" transcription terminator. Plasmid pKR227 was derived by 
ligating the Sa/I fragment of pJS93 containing soy BD30 promoter (WO 01/68887) 
with the Sa/I fragment of pUC19. The BD30 3' transcription terminator was PCR- 
ampiified from soy genomic DNAusing primer oSBD30-1 (SEQ ID NO:91), designed 
to introduce an Not\ site at the 5' end of the terminator, and primer oSBD30-2 (SEQ 
ID NO:92). designed to introduce a Bs/WI site at the 3' end of the temiinator. 

TGCGGCCGCATGAGCCG (SEQ ID NO:91) 



ACGTACGGTACCATCTGCTAATATTTTAAATC (SEQ ID NO:92) 

The resulting PGR fragment was subcloned into the intermediate cloning vector 
pCR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to give 
plasmid pKR251r. 

EXAMPLE 8 

Assembling EPA biosvnthetic oathwav genes for expression in Somatic Soybean 

FmhrvQS-DKR328 & PKR329 
The EPA biosynthetic genes were tested in combination in order to assess 
their combined activities in somatic soybean embryos before large-scale production 
transformation into soybean. Each gene was cloned into an appropriate expression 
cassette as described below. 

Plasmid pKR329 was similar to pKR275, described in detail in Example 4. in 
that it contained the same KTi/Fad3/KTi3' and Ahn/Sdd17/BD30 cassettes allowing 
for strong, seed specific expression of the Arabidopsis Fad3 and S. diclina delta17 
desaturase genes. It also contained the T7prom/HPT/T7term cassette and a 
bacterial on. Plasmid pKR329 differed from pKR275 in that it contained the 
hygromycin phosphotransferase gene cloned behind the 35S promoter followed by 
the NOS 3* untranslated region (35S/HPT/NOS3' cassette) instead of the 
SAMS/ALS/ALS3' cassette. The 35S/HPT/NOS3' cassette allowed for selection of 
transformed plant cells on hygromycin-containing media. 

Plasmid pKR329 was constructed in many steps from a number of different 
intermediate cloning vectors. The KTi/Fad3/KTi3' cassette was released from 
plasmid pKR201 (Example 7) by digestion with Bs/WI and was cloned into the 
Bs/WI site of plasmid pKR325. containing the 35S/HPT/NOS3' cassette, the 
T7prom/hpt/T7temi cassette and bacterial on. This was called plasmid pKR327. 
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The Ann/Sdd17/BD30 cassette, released from pKR271 (Example 3) by digestion 
with Pst\, was then cloned into the Sbf\ site of pKR327 to give pKR329. Plasmid 
pKR325 was generated from pKR72 (Example 4) by digestion with H/ndlll to 
remove the pcon/A/ofl/Phas3' cassette. 
5 Plasmid pKR328 was identical to pKR329, described above, except that it did 

not contain the KTi/Fad3/KTi3' cassette. The Ann/Sdd17/BD30 cassette, released 
from pKR271 (Example 3) by digestion with Psfl, was cloned into the Si)/I site of 
pKR325 (described above) to give pKR328. 

EXAMPLE 9 

10 Transformation of Somatic Soybean Embrvo Cultures 

Culture Conditions 

Soybean embryogenic suspension cultures (cv. Jack) were maintained in 

35 ml liquid medium SB196 (see recipes below) on rotary shaker, 150 rpm, 26°C 

with cool white fluorescent lights on 16:8 hr day/night photoperiod at light intensity of 
1 5 60-85 pE/m2/s. Cultures are subcultured every 7 days to two weeks by inoculating 

approximately 35 mg of tissue into 35 ml of fresh liquid SB196 (the preferred 

subculture interval is every 7 days). 

Soybean embryogenic suspension cultures were transformed with the 

plasmids and DNA fragments described in the following examples by the method of 
20 particle gun bombardment (Klein et al. 1987; Nature, 327:70). A DuPont Biolistic 

PDS1000/HE instrument (helium retrofit) was used for all transformations. 

Soybean Embryogenic Suspension Culture Initiation 

Soybean cultures were initiated twice each month with 5-7 days between 

each initiation. 

25 Pods with immature seeds from available soybean plants 45-55 days after 

planting were picked, removed from their shells and placed into a sterilized magenta 
box. The soybean seeds were sterilized by shaking them for 15 minutes in a 5% 
Clorox solution with 1 drop of ivory soap (95 ml of autoclaved distilled water plus 
5 ml Clorox and 1 drop of soap). Mix well. Seeds were rinsed using 2 1 -liter bottles 

30 of sterile distilled water and those less than 4 mm were placed on individual 

microscope slides. The small end of the seed was cut and the cotyledons pressed 
out of the seed coat. Cotyledons were transferred to plates containing SB1 medium 
(25-30 cotyledons per plate). Plates were wrapped with fiber tape and stored for 
8 weeks. After this time secondary embryos were cut and placed into SB196 liquid 

35 media for 7 days. 

Preparation of DNA for Bombardment 

Either an intact plasmid or a DNA plasmid fragment containing the genes of 
interest and the selectable marker gene was used for bombardment. Plasmid DNA 
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for bombardment was routinely prepared and purified using the n^ethod described in 
the Promega™ Protocols and Applications Guide. Second Edition (page 106). 
FrCeT^f PKR274 (Example 4). pKKE2 (Example 5) and pKR275 » 7^^^ 
were obtained by gel isolation of double digested plasmids. In each case 1 00 ug of 
plasmid DNA was digested in 0.5 ml of the specific enzyme mix described below. 
Plasmid PKR274 (Example 4) and pKKE2 (Example 5) were digested with >^scl 
(100 units) and EcoRI (100 units) in NEBuffer 4 (20 mM Tris-acetate. 10 mM 
magnesium acetate. 50 mM potassium acetate. 1 mM dithiothreitol. pH 7.9) 
100 ug/ml BSA. and 5 mM beta-mercaptoethanol at 37»C for 1 .5 hr. Plasmid 
PKR275 (Example 7) was digested with Asc\ (100 units) and Sg/I (50 units) in 
NEBuffer 2 (10 mM Tris-HCI. 10 mM MgCh, 50 mM NaCI. 1 mM dithiothreitol, 
pH 7 9) 100 ug/ml BSA. and 5 mM beta-mercaptoethanol at 37<'C for 1 .5 hr. The 
resulting DNA fragments were separated by gel electrophoresis on ^% SeaPlaque 
GTG agarose (BioWhitaker Molecular Applications) and the DNA fragments 
containing EPA biosynthefic genes were cut from the agarose gel. DNA was 
purified from the agarose using the GELase digesting enzyme following the 

manufacturer's protocol. ^- , „ /o „,„ 

A 50 ]i\ aliquot of sterile distilled water containing 3 mg of gold particles (3 mg 
gold) was added to 5 pi of a 1 pg/pl DNA solution (either intact plasmid or DNA 
fragment prepared as described above). 50 pi 2.5M CaC^ and 20 pi of 0.1 M 
spermidine. The mixture was shaken 3 min on level 3 of a vortex shaker and spun 
for 10 sec in a bench microfuge. After a wash with 400 pi ^00% ethanol the pellet 
was suspended by sonication in 40 pi of W/o ethanol. Five pi of DNA suspension 
was dispensed to each flying disk of the Biolistic PDS1000/HE Instrument disk. 
Each 5 Ml aliquot contained approximately 0.375 mg gold per bombardment (i.e. per 
disk). 

Tissue Preparation a nd BombardmRnt with DNA 

Approximately 150-200 mg of 7 day old embryonic suspension cultures were 
placed in an empty, sterile 60 x 15 mm petri dish and the dish covered with plastic 
mesh Tissue was bombarded 1 or 2 shots per plate with membrane rupture 
pressure set at 1100 PSI and the chamber evacuated to a vacuum of 27-28 inches 
of mercury. Tissue was placed approximately 3.5 inches from the retaining / 
stopping screen. 

.qftlection of Transform ed Embrvos 

Transformed embryos were selected either using hygromycin (when the 
hygromycin phosphotransferase. HPT. gene was used as the selectable marker) or 
chlorsulfuron (when the acetolactate synthase. ALS. gene was used as the 
selectable marker). 
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Hygromvcin (HPT) Selection 

Following bombardment, the tissue was placed into fresh SB196 media and 
cultured as described above. Six days post-bombardment, the SB196 is exchanged 
with fresh SB1 96 containing a selection agent of 30 mg/L hygromycin. The 
5 selection media is refreshed weekly. Four to six weeks post selection, green, 
transformed tissue may be observed growing from untransformed, necrotic 
embryogenic clusters. Isolated, green tissue was removed and inoculated into 
multiwell plates to generate new, clonally propagated, transformed embryogenic 
suspension cultures. 

10 Chlorsulfuron (ALS) Selection 

Following bombardment, the tissue was divided between 2 flasks with fresh 
SB196 media and cultured as described above. Six to seven days post- 
bombardment, the SB196 was exchanged with fresh SB196 containing selection 
agent of 100 ng/ml Chlorsulfuron. The selection media was refreshed weekly. Four 

15 to six weeks post selection, green, transformed tissue may be observed growing 
from untransformed, necrotic embryogenic clusters. Isolated, green tissue was 
removed and inoculated into multiwell plates containing SB196 to generate new, 
clonally propagated, transformed embryogenic suspension cultures. 
Regeneration of Soybean Somatic Embryos into Plants 

20 In order to obtain whole plants from embryogenic suspension cultures, the 

tissue must be regenerated. 
Embryo Maturation 

Embryos were cultured for 4-6 weeks at 26^C in SB196 under cool white 
fluorescent (Phillips cool white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 

25 Agro) bulbs (40 watt) on a 16:8 hr photoperiod with light intensity of 90-120 uE/m2s. 
After this time embryo clusters were removed to a solid agar media, SB166, for 
1-2 weeks. Clusters were then subcultured to medium SB103 for 3 weeks. During 
this period, individual embryos can be removed from the clusters and screened for 
alterations in their fatty acid compositions as described in Example 11. It should be 

30 noted that any detectable phenotype, resulting from the expression of the genes of 
interest, could be screened at this stage. This would include, but not be limited to, 
alterations in fatty acid profile, protein profile and content, carbohydrate content, 
growth rate, viability, or the ability to develop normally into a soybean plant. 
Embryo Desiccation and Germination 

35 Matured individual embryos were desiccated by placing them Into an empty, 

small petri dish (35 x 1 0 mm) for approximately 4-7 days. The plates were sealed 
with fiber tape (creating a small humidity chamber). Desiccated embryos were 
planted into SB71-4 medium where they were left to germinate under the same 
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culture conditions described above. Gemninated plantlets were removed from 
germination medium and rinsed thoroughly with water and then planted m Redi- 
Earth in 24-cell pack tray, covered with clear plastic dome. After 2 weeks the dome 
was removed and plants hardened off for a further week. If plantlets looked hardy 
they were transplanted to 10" pot of Redi-Earth with up to 3 plantlets per pot. After 
10 to 16 weeks, mature seeds were harvested, chipped and analyzed for fatty acids 
as described in Examples 1 0 and 11. 
Media Recipes 

SB 196 - FN Lite liquid proliferatiop mp.Hium (per liter) - 

MS FeEDTA - 1 0Ox Stock 1 "1 0 "i' 

MS Sulfate - 1 0Ox Stock 2 1 0 "i' 

FN Lite Halides - 1 0Ox Stock 3 1 0 ml 

FN Lite P ,B,Mo - 1 0Ox Stock 4 1 0 ml 

B5 vitamins (Iml/L) 

2 4-D (1 Omg/L final concentration) 1 -0 "i' 

KN03 2.83 gm 

(NH4)2S0 4 0.463 gm 

Asparagine ''Osm 
Sucrose (1%) ""^ 9"^ 

pH 5.8 

FN Lite Stnnk Solutions 

Stock # ^ 
1 MS Fe EDTA 1 0Ox Stock 

NagEDTA* 3.724 g 1 862 g 

FeS04-7H20 2.784 g 1-392 g 
* Add first, dissolve in 6ark bottle while stirring 

2 WIS Sulfate 100x stock 

MgS04-7H20 37.0 g 18.5 g 

MnS04-H20 1.69 g 0-845 g 

ZnS04 - 7H2O 0.86 g 0.43 g 

CUSO4 - 5H2O 0.0025 g 0.00125 g 

3 FN Lite Halides 1 0Ox Stock 

CaCl2-2H20 30.0 g 15.0 g 

Kl 0.083 g 0.0715 g 

C0CI2 - 6H2O 0.0025 g 0.001 25 g 
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4 



FN Lite P,B,M6 100x Stock 
KH2PO4 
H3BO3 

Na2Mo04 - 2H2O 



18.5 g 
0.62 g 
0.025 g 



9.25 g 
0.31 g 
0.0125 g 



SB1 solid medium (per iitert- 



1 pkg . MS salts (Gibco/ BRL - CatiS^ 1111 7-066) 

1 ml B5 vitamins 1000X stock 
31 .5 g sucrose 

2 ml 2,4-D (20mg/L final concentration) ' 
pH 5.7 

SgTCagar 

SB 166 solid medium (per liter) - 

1 pkg. MS salts (Gibco/ BRL - Cam 1111 7-066) 

1 ml B5 vitamins 1000X stock 
60 g maltose 

750 mg MgCI2 hexahydrate 
5 g activated charcoal 
pH 5.7 

2 g gelrite 

SB 103 solid medium (per liter) - 

1 pkg. MS salts (Gibco/BRL - CaW 1111 7-066) 

1 ml B5 vitamins 1000X stock 
60 g maltose 

750 mg MgCI2 iiexahydrate 
pH 5.7 

2 g gelrite 

SB 71-4 solid medium (per liter) — 

1 bottle Gamborg's B5 salts w/ sucrose (Gibco/BRL - Cat# 211 53-036) 

pH 5.7 

5 g TC agar 

2.4-D stock 

- obtained premade from Phytotech cat# D 295 - concentration is 1 mg/ml 
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pc x/-....^.nc e^tnnk (net 1 0n - store alinuots at -20C 
10 g myo-inositol 
100 mg nicotinic acid 
100 mg pyridoxine HCI 

1 g tliiamine , ^ * • 

If the solution does not dissolve quickly enough, apply a low level of heat via 

the hot stir plate. 

rhinfRiilfuron Stock 
0 -1mg / ml In 0.01 N Ammonium Hydroxide 

FXAMPLE 10 

A^^.y..o <^nm.tin fiov Er^Kr yn. rnntainino vRrious promoters driving M. alpina 

Helta-6 desaturase 

5 Mature somatic soybean embryos are a good model for zygotic embryos. 

While in the globular embryo state in liquid culture, somatic soybean embryos 
contain very low amounts of triacylglycerol or storage proteins typical of matunng. 
zygotic soybean embryos. At this developmental stage, the ratio of total 
triacylglyceride to total polar lipid (phospholipids and glycolipid) is about 1 :4. as is 
20 typical of zygotic soybean embryos at the developmental stage fr°"^J^hich the 

somatic embryo culture was initiated. At the globular stage as well, the mRNAs for 
the prominent seed proteins, a-subunit of p-conglycinin. kunitz trypsin inhibitor 3. 
and seed lectin are essentially absent. Upon transfer to homione-free media to 
allow differentiation to the maturing somatic embryo state, triacylglycerol becomes 
25 the most abundant lipid class. As well. mRNAs for a'-subunit of p-conglycin.a 
kunitz trypsin inhibitor 3 and seed lectin become very abundant messages in the 
total mRNA population. On this basis somatic soybean embryo system behaves 
very similarly to maturing zygotic soybean embryos in vivo, and is therefore a good 
and rapid model system for analyzing the phenotypic effects of modifying the 
30 expression of genes in the fatty acid biosynthesis pathway. Most importantly the 
model system is also predictive of the fatty acid composition of seeds from plants 
derived from transgenic embryos. 

Transgenic somatic soybean embryos containing the M. alpina delta-b 
desaturase expression vectors described in Example 2 were prepared using the 
35 methods desdribed In Example 9. Fatty acid methyl esters were prepared from 

single matured, somatic soy embryos by transesterification. Embryos were placed 
in a vial containing 50 pL of trimethylsulfonium hydroxide (TMSH) and 0.5 mL of 
hexane and were incubated for 30 minutes at room temperature while shaking. 
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Fatty acid methyl esters (5 |jL injected from hexane layer) were separated and 
quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an 
Omegawax 320 fused silica capillary column (Supeico Inc., Cat#24152). The oven 
temperature was programmed to hold at 220°C for 2.7 min, increase to 240*^0 at 
5 20°C /min and then hold for an additional 2.3 min. Carrier gas was supplied by a 
Whatman hydrogen generator. Retention times were compared to those for methyl 
esters of standards commercially available (Nu-Chek Prep, Inc. catalog #U-99-A). 
The amount of GLA accumulated in embryo tissue was used as an indicator of the 
strength of each individual promoter. As indicated in Table 6, all of the promoters 
1 0 tested were capable of driving expression of the M. alpina delta-6 desaturase. 

TABLE 6 

GLA Accumulation in Soybean Somatic Erribryos: M. alpina delta-6 desaturase 

gene linked to various promoters 

15 



Promoter 


GLA (% fatty 
acid) 


Soy a'-subunit P- 


40+ 


conglycinin 




Soy KTI 3 


40+ 


Soy Annexin 


40 


Soy Glycinin 1 


35 


Soy 2S albumin 


22 


Pea Legumin A1 


10 


Soy p'-subunit p- 


9 


conglycinin 




Soy BD30 


8 


Pea Legumin A2 


3 



EXAMPLE 11 

Analvsis of transgenic Somatic Sov Embrvos and Seed Chios containing EPA 

Biosvnthetic Genes 

20 Transgenic somatic soybean embryos containing the expression vector 

pKR275 (Example 7) and either pKR274 (Example 4) or pKKE2 (Example 5) were 
prepared using the methods described in Example 9. 

A portion of the somatic soy embryos from each line generated was 
harvested and analyzed for fatty acid composition by GC as described in Example 
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10. Approximately 10 embryos were analyzed for each individual transfomiation 
event Fatty acids were identified by comparison of retention times to those for 
authentic standards. In this way. 471 events were analyzed for pKR274/pKR275 
and 215 events were analyzed for pKKE/pKR275. From the 471 lines analyzed for 
pKR274/pKR275, 10 were identified that produced EPA (average of 10 individual 
embryos) at a relative abundance greater than 7% of the total fatty acids. The best 
line analyzed averaged 9% EPA with the best embryo of this line having 1 3% EPA. 
From the 21 5 lines analyzed for KKE/KR275. 1 1 lines were identified that produced 
EPA (average of 1 0 individual embryos) at a relative abundance greater than 9% of 
the total fatty acids. The best line analyzed averaged 13% EPA with the best 
embryo of this line having 16% EPA. The best EPA-producing events from each 
construct set are shown in Table 7. In Table 7, clones 3306-2-3 to 3324-1-3 are 
pKR274/pKR275 events and 3338-6-3 to 3338-6-24 are pKKE2 events. Fatty acids 
in Table 7 ar defined as X:Y where X is the fatty acid chain length and Y is the 
number of double bonds. In addition, fatty acids from Table 7 are further defined as 
follows where the number in parentheses corresponds to the position of the double 
bonds from the carboxyl end of the fatty acid: 18:1=18:1(9). 18:2=18:2(9,12). 
GI_A=18-3(6.9.12). 18:3=18:3(9,12.15). STA=18:4(6,9,12,15), HGLA=20:3(8.11.14). 
ARA=20:4(5.8.11,14). ETA=20:4(8,1 1.14,17). EPA=20:5(5.8, 11,14,17) and 
DPA=22:5(7.10.13,16,19). Fatty acids listed as "others" include: 20:0, 20:1(5), 
20:2(11,14),'20:3 (5.11,14), 20:3 (11.14,17), 20:4 (5.11,14.17), and 22:0. For KKE2 
events each of these fatty acids is present at relative abundance of less than 1 % of 
the total fatty acids. For KR274/275 each of these fatty acids is present at relative 
abundance of less than 1% of total fatty acids except for events 3306-5-2. 3319-6-1 . 
3319-2-13 in which 20:3 (11.14.17) and 20:4 (5.11.14.17) are both in the range of 
1 .1 to 2.2% of total fatty acids. 
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TABLE 7 

Fatty acid analyses of transgenic soybean somatic embryos producing C20 PUFAs 





16:0 


18:0 


18:1 


18:2 


GLA 


18:3 


STA 


HGLA 


ARA 


ETA 


EPA 


DPA 


Otfiers 


3306-2-3 


14.9 


2.3 


6.3 


15.8 


21.7 


11.5 


4.5 


4.8 


0.8 


2.7 


8.4 


1.2 


2 


w O W— 9— i« 


14.2 


4.4 


11 7 


19.4 


4,6 


20.8 


1.5 


1.5 


0.2 


1.6 


7.7 


4.2 


5.3 


33i 9.3.1 


18.2 




11 n 

1 1 .U 


19.1 


15.6 


14.5 


3.4 


1.8 


1.3 


0.6 


8.4 


0.6 


1.2 


3319-6-1 


11.1 


3.7 


16.6 


12.9 


10.7 


12.1 


3.3 


5.0 


0.8 


2.8 


9.3 


2.0 


4 


1 9**iC— 1 V 


12.7 




17 t% 


1^ 9 


■ 10.8 


15.9 


3.1 


2.4 


0.1 


2.8 


8.0 


1.1 


3.3 


3319-2.16 

WW 1 W 1 w 


12.7 


2.5 


0.9 


1A 1 


10.3 


12.1 


2.3 


3.4 


4.0 


1.0 


7.3 


2.6 


2.3 


3319.3-6 

WW I 9— «9— o 


11, 7 


2.0 


in 1 


13.2 


11 .5 


7.7 


1.9 


2.8 


0.7 


1.8 


9.3 


1.8 


3.3 


3320-6-1 


15.3 


3.7 


13.5 


10.7 


14.8 


12.4 


4.5 


6.6 


1.4 


2.4 


8.0 


1.2 


2.4 


3322-5-2 


13!9 


2!9 


14!4 


isie 


17!4 


13.8 


3.5 


2.9 


0.2 


1.8 


8.1 


0.9 


2.2 


3324-1-3 


12.0 


4.4 


18.6 


17.6 


13.9 


7.8 


1.8 


4.8 


0.3 


3.4 


8.1 


0.8 


2.9 


3338-6-3 


14.3 


3.2 


18.1 


11.0 


13.7 


8.8 


3.0 


5.1 


0.2 


5.3 


9.6 


1.2 


2.1 


3338-7-11 


20.5 


2.9 


9.9 


10.6 


8.9 


17.3 


3.8 


2.0 


0.4 


3.0 


12.8 


1.8 


1.9 


3338-7-12 


16.5 


2.1 


16.2 


15.4 


16.1 


11.5 


2.5 


1.7 


0.2 


2.0 


10.0 


0.8 


1.2 


3338-3-4 


20.2 


3.9 


6.7 


11.9 


9.9 


10.5 


3.9 


4.6 


1.8 


3.1 


12.0 


3.2 


2.1 


3338-3-5 


14.7 


2.2 


12.4 


12.4 


17.6 


10.8 


4.7 


2.9 


1.3 


1.4 


10.0 


0.9 


1.8 


3338-6-10 


13.7 


1.8 


12.4 


8.3 


16.4 


14.0 


5.8 


3.2 


0.3 


4.0 


12.1 


1.2 


2.2 


3338-6-12 


13.9 


2.4 


13.1 


9.4 


22.7 


5.7 


3.1 


4.0 


0.4 


3.3 


13.3 


0.9 


1.5 


3338-7-21 


14.8 


1.7 


8.4 


13.1 


20.2 


12.5 


4.8 


3.9 


0.4 


3.6 


11.6 


0.6 


2 


3338-7-30 


15.4 


2.8 


18.9 


12.9 


9.6 


10.1 


2.4 


2.3 


0.5 


2.3 


13.0 


2.6 


2.4 


3338-1-4 


14.1 


2.1 


10.6 


26.3 


13.8 


9.6 


1.9 


3.3 


1.1 


1.9 


10.1 


1.0 


1.3 


3338-6-24 


25.1 


4.5 


13.3 


4.0 


15.5 


3.1 


2.6 


5.3 


0.7 


4.0 


13.0 


0.9 


1.7 



Mature plants were regenerated from the highest EPA-producing embryos as 
described in Example 10, and the fatty acid analyses were performed on chips of 
the seeds from the regenerated plants. The results for six seeds from three plants 
are presented In Table 8. Seeds of control plants possessed fatty aid profiles typical 
of normal soybean, in which linolenic acid (18:3) was the most highly unsaturated 
fatty acid that was detectable. Seeds produced from plants that had a reconstituted 
pathway for C20 PUFAs had as much as 25% of their total fatty acid in the form of 
C20 material. Combined levels of EPA and DPA were frequently greater than 15%, 
and were as high as 23.5% of the total. 



TABLE 8 



Event 


16:0 


18:0 


18:1 


18:2 


GLA 


18:3 


STA 


HGLA ARA 


ETA 


EPA 


DPA 


Other EPA+ 






























DPA 


3338-3-4-7 


14.4 


8.5 


19.7 


9.1 


9.1 


3.1 


1.2 


6.6 


1.0 


2.4 


18.8 


4.1 


2.0 


22.9 




13.2 


5.5 


18.6 


10.4 


11.7 


3.3 


1.1 


10.1 


2.2 


2.4 


19.6 


0.8 


1.2 


20.4 




15.6 


g.o 


13.9 


16.6 


6.6 


7.1 


0.0 


3.9 


0.0 


1.8 


15.5 


4.2 


5.8 


19.7 




22.4 


8.8 


20.8 


14.2 


5.0 


3.8 


0.6 


3.0 


1.0 


1.1 


14.0 


3.1 


2.2 


17.1 




13.2 


7.5 


27.0 


12.8 


9.0 


2.8 


0.9 


5.7 


1.8 


1.2 


11.2 


4.0 


2.9 


15.2 




15.2 


4.9 


18.3 


12.3 


13.3 


3.5 


1.3 


10.5 


5.3 


2.4 


12.9 


0.0 


0.0 


12.9 


3338-7-11-11 


13.0 


7.1 


13.6 


13.1 


13.0 


5.9 


1.7 


5.2 


0.5 


0.4 


16.4 


4.3 


5.8 


20.7 




12.9 


7.3 


13.1 


14.9 


9.6 


7.2 


1.7 


5.9 


0.8 


0.6 


14.3 


4.7 


7.0 


18.9 




12.4 


7.6 


15.9 


12.6 


13.6 


5.4 


1.8 


6.0 


0.5 


0.0 


15.2 


3.7 


5.2 


18.9 




15.0 


5.9 


18.4 


16.0 


10.2 


8.4 


1.7 


4.0 


0.6 


0.0 


13.9 


2.4 


3.5 


16.3 




13.8 


5.9 


19.6 


18.0 


7.2 


10.8 


1.5 


3.4 


0.4 


0.0 


10.8 


3.2 


5.5 


14.0 




16.2 


6.2 


15.2 


22.4 


6.9 


9.2 


1.1 


3.4 


0.8 


0.0 


11.7 


2.2 


4.6 


13.9 
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Event 16:0 18:0 18:1 18:2 GLA 18:3 STA HGLA ARA ETA 

3339-5-3.7 13.7 8.1 6.9 8. 16.5 4.7 1.8 7.1 0.7 2.2 



15.4 
14,7 
12.3 
12.2 
13.5 



8.1 


6.9 


8.1 


16.5 


4.7 


6.9 


11.8 


16.4 


10.0 


4.3 


6.3 


13.6 


18.1 


8.1 


3.1 


6.5 


20.9 


13.1 


15.1 


3.0 


6.4 


22.9 


13.7 


12.0 


2.9 


7.2 


22.9 


11.8 


8.9 


3.6 


4.3 


13.4 


51.6 


0.0 


12.9 


4.8 


12.1 


50.5 


0.0 


14.5 



0.9 4.3 2.1 0.1 

1.0 6.1 1.2 1.4 

0.9 5.7 1.3 1.3 

0.8 6.5 2.2 1.7 



PDA 


DPA Other EPA+ 








DPA 


19.5 


4.0 


6.7 


23.5 


16.3 


3.5 


7.3 


19.8 


14.9 


4.2 


9.6 


19.1 


10.6 


1.4 


7.3 


12.1 


9.9 


1.7 


9.1 


11.7 


9.6 


1.6 


9.8 


11.2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



Control 17 3 4.3 13.4 51.6 O.u rz.» 0.0 0.0 0.0 0.0 
oontroi u.o ^. QQ QQ 

FXAMPLE 12 

5 Isolation of a Novel Elon na«^«=> ^^^ne from the 

Al gae PavlovP «?r» (CCMP459) 
The fatty acid composition of the algae Pavlova sp. (CCMP 459) (Pav459) 
was investigated to determine the polyunsaturated fatty acids (PUFAs) produced by 
this organism. This algae showed a substantial amount of long chain PUFAs 

1 0 including eicosapentaenoic acid (EPA. 20:5n-3) and docosahexaeno.c acid (DHA. 
22-6n-3) Thus Pav459 was predicted to possess an elongase capable of 
converting EPA to a,3-docosapentaenoic acid (DPA. 22:5n-3). which a delta.4 
desaturase can convert to DHA The goal was therefore to isolate the predicted 
elongase gene from Pav459. and to verify the functionality of the enzyme by 

15 expression in an alternate host. 

Frozen pellets of Pav459 were obtained from Provasoli-Guillard National 
Center for Culture of Marine Phytoplankton (CCMP. West Boothbay Harbor. ME). 
These pellets were crushed in liquid nitrogen and total RNA was extracted from 
Pav459 by using the Qiagen RNeasy Maxi Kit (Qiagen. Valencia. CA). per 

20 manufacturers Instructions. From this total RNA. mRNA was isolated using oligo dT 
cellulose resin, which was then used for the construction of a cDNA library using the 
pSport 1 vector (Invitrogen. Carlsbad. CA). The cDNA thus produced was 
directionally cloned (S'Sa/Z/SWof/) into pSportI vector. The Pav459 libranr contained 
approximately 6.1 x 105 clones per ml. each with an average insert size of 

25 approximately 1200 bp. Two thousand five hundred primary '^'^"^^ fr^"]^^^^^ 
were sequenced from the 5' end using the T7 promoter primer (SEQ ID N0.93). 

TAATACGACTCACTATTAGG SEQ ID NO:93 

30 Sequencing was carried out using the ABI BigDye sequencing kit (Applied 

Biosystems. CA) and the MegaBase Capillary DNA sequencer (Amersham 
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biosciences, Piscataway, NJ). Two clones, designated 'pav06-C06' and pav07- 
G01,' which aligned to give a 500 bp sequence containing the 5' end of this novel 
elongase, were obtained from sequencing of the 2,500 library clones. This fragment 
shared 33.3% amino acid sequence identity with the mouse elongase MEL04 and 
5 32.7% amino acid sequence identity with 7. aureum elongase TEL01 (WO 

02/08401). To isolate the full-length gene, the EST clone pav06-C06 was used as a 
template for PGR reaction with 10 pmol of the 5' primer R01327 (SEQ ID NO:94) 
and 10 pmol vector primer R0898 (SEQ ID NO:83). 

1 0 TGCCCATGATGTTGGCCGCAGGCTAtCTTCTAGTG SEQ I D NO:94 

PGR amplification was carried out using Platinum Taq DNA polymerase 
(Invitrogen, Garlsbad, GA) in a 50 |jl total volume containing: 1 pi of the cDNA clone 
pav06-G06, PGR buffer containing 20 mM Tris-GI, pH 8.4, 50 mM KGI (final 

15 concentration), 200 pM each deoxyribonucleotide triphosphate, 10 pmole of each 
primer, 1.5 mM MgS04, and 0.5 pi of Platinum Taq (HF) DNA polymerase. 
Amplification was carried out as follows using the Perkin Elmer 9700 machine: initial 
denaturation at 94°G for 3 minute, followed by 35 cycles of the following: 94°G for 
45 sec, 55°G for 30 sec, SS^'G for 2 min. The reaction was terminated at 4''G. The 

20 PGR amplified mixture was run on a gel, an amplified fragment of approximately 
1.3 Kb was gel purified, and the isolated fragment was cloned into the pGR-blunt 
vector (Invitrogen, Garlsbad, GA). The recombinant plasmid was transformed into 
TOP10 supercompetent cells (Invitrogen, Garlsbad, GA), and prepared. The 
prepared recombinant plasmid was digested with EcoR\, run on a gel, and the 

25 digested fragment of approximately 1.2 Kb was gel purified, and cloned into pYX242 
(EcoRI) vector (Novagen, Madison, Wl). The new plasmid was designated as 
pRPL-6-1. 

The plasmid pRPL-6-1 was prepared and sequenced using ABI 373A Stretch 
DNA Sequencer (Perkin Elmer, Foster Gity, GA). The translated amino acid 
30 sequence of the cDNA in pRPL-6-1 had 33.7% identity in 261 amino acids with 
MEL04, 33.8% identity in 240 amino acids with GLELO, 28.1% identity in 
274 amino acids with HSEL01, and 32.5% identity in 246 amino acids with TEL01 
(WO 02/08401). 

The construct pRPL-6-1 was transformed into S. cerevisiae 334 (Hoveland et 
35 al. (1989) Gene 83:57-64) and screened for elongase activity. S. cerevisiae 334 
containing the unaltered pYX242 vector was used as a negative control. The 
cultures were grown for 44 hours at 24°G, in selective media (Ausubel et al., (1992) 
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.Ho^ Protocols in Molecular Biology. Ch. 13. p. 3-5). In the presence of 25^^^^^^^ 
GLA or EPA. In this study. DGLA or a,3<locosapentaenoic acd (DPA, 22:5n-3), 
respectively, was the predicted product of the elongase activity. The lipid profiles of 
these yeast cultures indicated that while no conversion of GLA to DGLA was seen. 
5 EPA was elongated to DPA at a very low level (DPA was 0.34o/o of total fatty ac.ds. 
while EPA was 32.28% of total fatty acids). This indicated that the expressed 
enzyme in this culture preferred the elongation of 20 carbon chain long PUFA. and 
not the 18 carbon chain long PUFA GLA. It also indicated that a niiitatlon might be 
present in the DNA sequence, which is inhibiting the full activity of the expressed 

1 0 enzyme.^ fuiuength gene without mutations. RACE (rapid amplification of 

cDNA ends) ready cDNA was used as a target for the reaction. To prepare this 
material, approximately 5 pg of total RNA was used according to the manufacturer s 
direction with the GeneRacer™ kit (Invitrogen, Carlsbad. CA) and Superscnpt II™ 

15 enzyme (Invitrogen. Carlsbad. CA) for reverse transcription to produce ^DNA target. 
This CDNA was then used as a template for a PGR reaction with 50 pmols of the 5 
primer R01327 and 30 pmol GeneRacer™ 3' primer (SEQ ID NO:95). 



20 



30 



GCTGTCAACGATACGCTACGTAACG 



SEQ ID NO:95 



PGR amplification was carried out using Platinum Taq DNA polymerase 
(Invitrogen. Carlsbad. CA) in a 50 pi total volume containing: 2 pi of the RACE 
ready cDNA. PCR buffer containing 20 mM Tris-CI. pH 8.4. 50 mM KCl (final 
concentration). 200 pIVl each deoxyribonucleotide triphosphate. 10 pmole of each 
25 primer, 1 .5 mM MgS04. and 0.5 pi of Platinum Taq (HF) DNA polymerase 

Amplification was carried out as follows using the Perkin Elmer 9600 machine: initial 
denaturation at 94°C for 3 minute, followed by 35 cycles of the following: 94«'C for 
45 sec 55»C for 30 sec. 68»C for 2 min. The reacfion was terminated at 4°C. 

The PCR amplified mixture was run on a gel. an amplified fragment of 
approximately 1 .2 Kb was gel purified, and the isolated fragment was cloned into the 
PCR-blunt vector (Invitrogen. Carlsbad. CA). The recombinant plasmids were 
transformed into TOP10 supercompetent cells (Invitrogen. Carlsbad. CA). and 
prepared. The prepared recombinant plasmid was digested with EcoRI. run on a 
gel and the digested fragment of approximately 1.2 Kb was gel purrfied. and cloned 
into PYX242 (EcoRl) vector (Novagen. Madison. Wl). The new plasmids were 
designated as pRPL-6-B2 and pRPL-6-A3. 

The plasmids pRPL-6-B2 and pRPL-6-A3 were prepared and sequenced 
using ABl 373A Stretch DNA Sequencer (Perkin Elmer. Foster City. CA). The 
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translated amino acid sequence of the cDNA in pRPL-6-B2 had 34.1% identity in 
261 amino acids with MEL04, 33.8% identity in 240 amino acids with GLELO, 
28.5% identity in 274 amino acids with HSEL01, and 32.5% identity in 246 amino 
acids with TEL01 . (Plasmid pRPL-6-B2 was deposited with the American Type 
5 Culture Collection, 1 0801 Manassas, VA 201 1 0-2209 under the terms of the 
Budapest Treaty and was accorded accession number PTA-4350.) 

The constructs pRPL-6-B2 and pRPL-6-A3 were transformed into 
S. cerevisiae 334 (Hoveland et al., supra) and screened for elongase activity. 
S. cerevisiae 334 containing the unaltered pYX242 vector was used as a negative 

10 control. The cultures were grown for 44 hours at 24°C, in selective media (Ausubel 
et al., supra), in the presence of 25 |jM of GLA or EPA. In this study, DGLA or 
CA)3-docosapentaenoic acid (DPA, 22:5n-3), respectively, was the predicted product 
of the elongase activity. The lipid profiles of these yeast cultures indicated that GLA 
was not elongated to DGLA in any of the samples (data not shown). The cultures of 

15 334(pRPL-6-B2) and 334(pRAT-6-A3) had significant levels of conversion of the 
substrate EPA to DPA, indicating that the expressed enzymes in these cultures 
preferred the elongation of 20-carbon chain long PUFA, and not the 18-chain long 
PUFA, GLA. 

The amino acid sequences of the 3 clones were compared to determine if the 

20 substrate conversion levels were dictated by the translated sequences. The cDNA 
sequence of pRPL-6-1 is different from pRPL-6-B2 at A512G. This single mutation 
substantially reduced the conversion of the C20 substrate fatty acid to its elongated 
product. It appears that this is an important region of the enzyme for 20-carbon 
chain elongation. The cDNA sequence of pRPL-6-A3 is different from pRPL-6-B2 at 

25 D169N and C745R. These mutations reduced the conversion of the C20 substrate 
fatty acid to its elongated product, but the expressed enzyme was able to maintain 
some activity. The elongase gene in pRPL-6-B2, has the sequence set forth in SEQ 
ID NO:49 and the amino acid sequence set forth in SEQ ID NO:50. 

To further confirm the substrate specificity of the algal elongation enzyme, 

30 described above and referred to herein as PELOIp, the recombinant yeast strain 
334(pRPL-6-B2) was grown in minimal media containing n-6 fatty acids LA, GLA, 
DGLA, AA, or n-3 fatty acids ALA, STA, ETA, EPA, or 20:0, or 20:1. The lipid 
profiles of these yeast cultures, when examined by GC and GC-MS, indicated that 
there were accumulations of adrenic acid (ADA, 22:4-6) and EPA, respectively. The 

35 levels of these fatty acids were 1 .40% ADA and 2.54% EPA, respectively, of the 
total fatty acids in the strains containing the PEL01 sequence. These represented 
14.0% and 14.1% conversions of the substrate fatty acids, respectively, to the 
products elongated by two carbon atoms. No elongation of the saturated fatty acid 
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20 0 or monounsaturated fatty acid 20:1 was seen. Also, no elongation of the C18 
substrates LA. GLA. ALA. or STA was seen. These results indicated that the 
expressed enzyme activity in strain 334(pRPL-6-B2) was specific for the elongation 
of 20-carbon chain long PUFAs. and not the 18-chain long PUFA. or the 20.carbon 
chain long saturated or monounsaturated fatty acids. 

FXAMPLE 13 

^^o^^Kiin^ r^MA Riosvnthe t ir- Pathway Genes for Fxpression in Somatic Soybean 
Fmhivos (DKP?tBfi PKR364 and pK R357) 

Hnnstructio n of olasmid dKR365 ^ „ ^. ,. 

The S diclina delta-6 desaturase. M. alpina delta-5 desaturase and S. diclina 
delta.17 desaturase were cloned into plasmid pKR365 behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. The deltas desaturase was cloned behind the KTi promoter 
followed by the KTi 3' termination region (Kti/Sdd6/Kti3' cassette). The delta-5 
15 desaturase was cloned behind the GlycinlnGyl promoter followed by the pea 

ieguminA2 3' termination region (Gy1/Mad5/legA2 cassette). The S. diclma delta-17 
desaturase was cloned behind the soybean Annexin promoter followed by the soy 
BD30 3- temiination region (Ann/Sdd17/BD30 cassette). Plasmid pKR365 also 
contains the T7prom/HPTn7term cassette for bacterial selection of the plasm.d on 
20 hygromycin B and a bacterial origin of replication (on) from the vector pSP72 
(Stratagene). 

Plasmid pKR365 was constructed from a number of different intemnediate 
cloning vectors as follows: The Gy1/Mad5/legA2 cassette was released from 
plasmid PKR287 by digestion with Sbfi and Bs/WI. This cassette was cloned into 
25 the SbntBsm site of plasmid pKR359. containing the Kti/Sdd6/Kti3' cassette, the 
T7prom/hptnTtemi cassette and the bacterial ori to give pKR362. The 
Ann/Sdd17/BD30 cassette, released from pKR271 (described in Example 7) by 
digestion wrth Psfl. was then cloned into the SbA site of pKR362 to give pKR365. . 
A schematic representation of pKR365 is shown in Figure 6. A detailed descnption 
30 for plasmid construction for pKR287 and pKR359 is provided below. 

Plasmid PKR287 was constructed by digesting pKR136 (described in 
Example 4) with Not\, to release the M. alpina delta-5 desaturase. and cloning this 
fragment into the Noti site of pKR263 (described in Example 4). 

Plasmid PKR359 was constructed by cloning the Noti fragment of pKR295. 
35 containing the delta-6 desaturase. Into the Noti site of the Ktl/Nofl/Kti3' cassette in 
PKR353 Vector pKR353 was constructed by cloning the H/ndlll fragment, 
containing the Kti/A/ofl/Kti3' cassette, from pKR124 (described in Example 2) into 
the H/hdIII site of pKR277. Plasmid pKR277 was constructed by digesting pKR197 
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(described in Example 4) with H/ndlll to remove tlie Bcon/A/ofl/phas3' cassette. To 
construct pKR295, the gene for the S. diclina delta-6 desaturase was removed from 
pRSP1 (Table 1) by digestion with EcoRI and EcoRV and cloned into the 
Mfe\/EcoRV site of pKR288. Vector pKR288 was an intermediate cloning vector 
5 containing a DNA staffer fragment flanked by NotUMfel sites at the 5* end and 
EcoRV/Notl sites at the 3' end of the fragment. The DNA stuffer fragment was 
amplified with Vent polymerase (NEB) from plasmid CalFad2-2 (described in 
WO 01/12800) using primer oCal-26 (SEQ ID NO:96), designed to introduce an Mfel 
site at the 5' end of the fragment, and oCal-27 (SEQ ID NO:97), designed to 
10 introduce an EcoRV site at the 3' end of the fragment. 

GCCAATTGGAGCGAGTTCCAATCTC (SEQ ID NO:96) 

GCGATATCCGTTTCTTCTGACCTTCATC (SEQ ID NO:97), 

15 

The primers also introduced partial Not\ sites at both ends of the fragment such that 
subsequent cloning into a filled Noti site added NoH sites to the end. 

Construction of plasmid pKR364 

20 The M.alpina delta-6 desaturase, M. a/p/na delta-5 desaturase and S. diclina 

delta-17 desaturase were cloned into plasmid pKR364 behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. Plasmid pKR364 is identical to pKR365 except that the Noil 
fragment that contains the S. diclina delta-6 desaturase in pKR365 was replaced 

25 with the Not\ fragment containing the M. alpine delta-6 desaturase as found in 
pKR274. A schematic representation of pKR364 is shown in Figure 7. 

Construction of plasmid pKR357 

The S. aggregatum delta-4 desaturase, M. alpina elongase and Pavlova 

30 elongase (Tablel) were cloned into plasmid pKR357 behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. The delta-4 desaturase (SEQ ID NO:51, and its protein 
translation product shown in SEQ ID NO:52) was cloned behind the KTi promoter 
followed by the KTi 3' termination region (Kti/Sad4/Ktl3' cassette). The Pavlova 

35 elongase (SEQ ID NO:4g) was cloned behind the GlycininGyl promoter followed by 
the pea leguminA2 3' termination region (Gy1/Pavelo/legA2 cassette). The M, 
alpina elongase was cloned behind the promoter for the a'-subunit of p-conglycinin 
followed by the 3' transcription termination region of the phaseolin gene 
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(|3con/Maelo/Phas3' cassette). Plasmid pKR357 also contains the 
T7prom/HPTn7term cassette for bacterial selection of the plasmid on hygromycin 
B. a 35S/hpt/NOS3' cassette for selection In soy and a bacterial origin of replication 

Plasmid pKR357 was constmcted from a number of different intermediate 
cloning vectors as follows: The Gy1/Pavelo/legA2 cassette was released from 
plasmid pKR336 by digestion with Pst\ and Bs/WI. The Gy1/Pavelo/IegA2 cassette 
was then cloned Into the Sbfi/Bsm site of plasmid pKR324. containing the 
pcon/Maelo/Phas3' cassette, the TZprom/hptnTtenm cassette, the 35S/hpt/Nos3' 
cassette and the bacterial onto give pKR342. The KTi/Sad4/KTI3' cassette, 
released from pKR348 by digestion with Psfl. was then cloned into the Sbn site of 
pKR342 to give pKR357. A schematic representation of pKR357 is shown in 
Figure 8. A detailed description for plasmid construction for pKR336. pKR324 and 

pKR348 is provided below. 

Plasmid pKR336 was constructed by digesting pKR335 with Noli, to release 
the Pavlova elongase. and cloning this fragment into the A/ofl site of pKR263 
(described in Example 4). which contained the Gy1/Wof//legA2 cassette. To 
construct pKR335, pRPL-6-B2 (described in Table 1) was digested witti Psfl and the 
3' overhang removed by treatment with VENT polymerase (NEB). The plasmid was 
then digested with EcoRI to fully release the Pavlova elongase as an EcoRUPsti 
blunt fragment. This fragment was cloned into the MfeMEcoRV site of intermediate 
cloning vector pKR333 to give pKR335. Vector pKR333 was identical to pKR288 
(Example 3 and 13) in that it contained the same iWfel and EcoRV sites falnked by 
Not\ sites and was generated in a similar way as pKR288. 

Plasmid pKR324 was constructed by cloning the A/ofl fragment of pKS134 
(described in Example 3). containing the M alpine elongase, into the Wofl site of the 
pcon/A/ofl/Phas3' cassette of vector pKR72 (described in Example 4). 

Plasmid pKR348 was constructed by cloning the A/ofl fragment of pKR300, 
containing the S. aggregatum delta-4 desaturase. into the A/ofl site of the 
KTi/A/ofl/KTi3' cassette in pKR123R. To construct pKRSOO. the gene for the delta-4 
desaturase was removed from pRSA1 (Table 1) by digestion with EcoRI and EcoRV 
and cloned into the /Wfel/EcoRV site of pKR288 (described In Example 3 and 13). 
Plasmid PKR123R contains a A/ofl site flanked by the KTi promoter and the KTi 
transcription termination region (KTi/A/ofl/KTI3' cassette). In addition, the 
KTi/A/ofl/KTi3' cassette was flanked by Psfl sites. The KTi/A/ofl/KTi3' cassette was 
amplified from pKS126 (described in Example 2) using primers oKTi5 (SEQ ID 
NO:23) and oKTi7 (SEQ ID NO:98) designed to introduce an Xba\ and Bs/WI site at 
the 5* end. and a Pstt/Sbfl and Xbal site at the 3' end, of the cassette. 
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TTCTAGACCTGCAGGATATAATGAGCCG (SEQ ID NO:98) 

The resulting PGR fragment was subcloned Into the Xba\ site of the cloning 
5 vector pUC19 to give plasmid pKR123R with the KTI/A/ofl/KTI3' cassette flanked by 
Psti sites. 

Production of DHA in somatic embrvos 

Plasmids pKR357, pKR365 and pKR364 were prepared as described in 
Example 9, Fragments of pKR365 and pKR364 were also obtained and purified as 

10 described for pKR274. pKR275 and pKKE2 in Example 9. Plasmids pKR357and 
either pKR365 or pKR364 were cotransformed into soybean embryogenic 
suspension cultures (cv. Jack) as described in Example 9. Hygromycin-resistant 
embryos containing pKR365 and pKR357, or pKR364 and pKR357 were selected 
and clonally propagated also as described in Example 9. Embryos were matured by 

15 culture for 4-6 weeks at 26*='C in SB196 under cool white fluorescent (Phillips cool 
white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 Agro) bulbs (40 watt) on a 
16:8 hr photoperiod with light Intensity of 90-120 pE/m2s. After this time embryo 
clusters were removed to a solid agar media, SB166, for 1-2 weeks. Clusters were 
then subcultured to medium SB103 for 3 weeks. During this period, individual 

20 embryos were removed from the clusters and screened for alterations in their fatty 
acid compositions as follows. 

Fatty acid methyl esters were prepared from single, matured, somatic soy 
embryos by transesterification as described in Example 10. Retention times were 
compared to those for methyl esters of standards commercially available (Nu-Chek 

25 Prep, Inc. catalog #U-99-A). Six embryos from each event were analyzed in this 
way. Fatty acid methyl esters from embryos transformed with pKR357 and pKR365 
containing the highest levels of DHA are shown In Table 9. 



71 



wo 2004/071467 



PCTAJS2004/005758 



TABLE 9 

Fatty acid analysis of somatic embryos containing DHA pathway genes 

(pKR357 and pKR365) 



Event '16:0 

1114-6-5-1 10.8 

1114-6-5-7 13.8 

1116-8-16-1 13.8 



•18:0 '18:1 '18:2 

9.4 2.3 28.8 

8 6.4 30.1 

7 6.2 27.3 



GLA 


■18:3 


•18:4 


0 


19.7 


2 


2.1 


15 


2 


4 


10.5 


0.9 


20:4(5,11.14.17) 


EPA 


DHA 


1.7 


2.5 


1.3 


1.6 


4.1 


1.6 


1.1 


6.1 


3.1 



20:2(11.14) 20:3(8,11.14) ARA 20:3(11.14.17) 

1114^5-1 6.2 3.2 1.4 4.2 

1114^5-7 3.7 4.3 2.9 1-9 

1116-8-16-1 4.6 3.9 5.2 

m adciition to those fatty acids shown. 20:0. 20:1. 20:3 (5.11.14). DPA and 
CTA are also present in the extracts, each less than 1 % of total fatty acds. 

DHA is defined as 22:6(4.7.10.13.16.19) by the nomenclature descnbed in 

ExamDie 1 1 

Fatty acid methyl esters for embryos transformed with pKR357 and pKR364 
containing the higest levels of DHA are shown in Table 10. 
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CLAIMS 



What is claimed is: . . , ^. ^ . , j 

1 An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises at least 1.0 % of at least one polyunsaturated fatty acd 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

2 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 5.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

3 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 10.0 % of at least one polyunsaturated fatly ac.d 
having at least twenty carbon atoms and five or more carbon-carbon double bonds^ 

4 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 15.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

5 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 20.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

6 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 25.0 % of at least one polyunsaturated fatty ac.d 
having at least twenty carbon atoms and five or more carbon-carbon double bonds^ 

7 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 30.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-cari3on double bonds. 

8 An oilseed plant that produces mature seeds In which the total seed fatty 
acid profile comprises at least 40.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

9 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 50.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds^ 

1 0 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 60.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

11. The oilseed plant of any of Claims 1-10 wherein the polyunsaturated fatty 

acid is an omega-3 fatty acid. ^ *o«x, 

12. The oilseed plant of any of Claims 1-10 wherein the polyunsaturated fatty 
acid is an omega-3 fatty acid selected from the group consisting of EPA. DPA, and 
DHA. 
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13. The oilseed plant of any of Claims 3-10 wherein the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

14. The oilseed plant of any of Claims 3-10 wherein the polyunsaturated fatty 
acid is an omega-3 fatty acid and the total seed fatty acid profile further comprises 

5 less than 2.0% arachidonic acid. 

15. The oilseed plant of any of Claims 3-10 wherein the polyunsaturated fatty 
acid is an omega-3 fatty acid selected from the group consisting of EPA, DPA, and 
DHA, and the total seed fatty acid profile further comprises less than 2.0% 
arachidonic acid. 

10 16. Seeds obtained from the plant of any of Claims 1-10. 

17. Seeds obtained from the plant of any of Claims 1-10 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid. 

18. Seeds obtained from the plant of any of Claims 1-10 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 

15 consisting of EPA, DPA, and DHA. 

1 9. Seeds obtained from the plant of any of Claims 3-1 0 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid and the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

20. Seeds obtained from the plant of any of Claims 3-1 0 wherein the 
20 polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 

consisting of EPA, DPA, and DHA, and the total seed fatty acid profile further 
comprises less than 2.0% arachidonic acid. 

21 . Oil obtained from the seeds of the plants of any of Claims 1-10. 

22. Oil obtained from the seeds of the plants of any of Claims 1-10 wherein 
25 the polyunsaturated fatty acid an omega-3 fatty acid. 

23. Oil obtained from the seeds of the plants of any of Claims 1-10 wherein 
the polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 
consisting of EPA, DPA, and DHA. 

24. Oil obtained from the seeds of the plants of any of Claims 3-10 wherein 
30 the polyunsaturated fatty acid is an omega-3 fatty acid and the total seed fatty acid 

profile further comprises less than 2.0% arachidonic acid. 

25. Oil obtained from the seeds of the plants of any of Claims 3-10 wherein 
the polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 
consisting of EPA, DPA, and DHA, and the total seed fatty acid profile further 

35 comprises less than 2.0% arachidonic acid. 

26. The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower. 
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27 The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean. Brassica species, sunflower, maize, cotton, flax 
and safflower. and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid 

" 28 The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean. Brassica species, sunflower, maize, cotton, flax, 
and safflower. and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid selected from the group consisting of EPA. DPA. and DHA. 

29 The plant of any of Claims 3-10 wherein the oilseed plant is selected from 
the group consisting of soybean. Brassica species, sunflower, maize, cotton, flax, 
and safflower. and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid and the total seed fatty acid profile further comprises less than 2.0% 
arachidonic acid. 

30 The plant of any of Claims 3-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower, and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid selected from the group consisting of EPA, DPA. and DHA, and the total seed 
fatty acid profile further comprises less than 2.0% arachidonic acid. 

31 A recombinant construct for altering the total fatty acid profile of mature 
seeds of an oilseed plant, said construct comprising at least two promoters wherein 
each promoter is operably linked to a nucleic acid sequence encoding a polypeptide 
required for making at least one polyunsaturated fatty acid having at least twenty 
carbon atoms and four or more carbon-carbon double bonds and further wherein the 
total fatty acid profile comprises at least 2% of at least one polyunsaturated fatty 
acid having at least twenty carbon atoms and four or more carbon-carijon double 
bonds and further wherein said polypeptide is an enzyme selected from the group 
consisting of a A 4 desaturase, a A 5 desaturase. A 6 desaturase. a A 15 
desaturase. a A 17 desaturase. a C18 to C22 elongase and a C20 to C24 elongase. 

32 The recombinant construct of Claim 31 wherein the promoter is selected 
from the group consisting of the alpha prime subunit of beta conglycinin promoter 
Kunitz trypsin inhibitor 3 promoter, annexin promoter. Gly1 promoter, beta subunit of 
beta conglycinin promoter. P34/Gly Bd m 30K promoter, albumin promoter. Leg A1 
promoter and Leg A2 promoter. ♦ 

33. An oilseed plant comprising in its genome the recombinant construct of 

Claim 31. . ^ * * 

34. An oilseed plant comprising in its genome the recombinant construct of 

Claim 32. 
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35. The oilseed plant of Claim 33 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
safflower. 

36. The oilseed plant of Claim 34 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
safflower; 

37. Seeds obtained from the plant of Claim 33. 

38. Seeds obtained from the plant of Claim 34. 

39. Seeds obtained from the plant of Claim 35. 

40. Seeds obtained from the plant of Claim 36. 

41 . Oil obtained from the seeds of Claim 37. 

42. Oil obtained from the seeds of Claim 38. 

43. Oil obtained from the seeds of Claim 49. 

44. Oil obtained from the seeds of Claim 40. 

45. A method for making an oilseed plant having an altered fatty acid profile 
which comprises: 

a) transforming a plant with the recombinant construct of Claim 31 ; 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises 
at least 1 .0 % of at least one polyunsaturated fatty acid having at 
least twenty carbon atoms and five or more carbon-carbon double 
bonds. 

46. An oilseed plant made by the method of Claim 30. 

47. Seeds obtained from the plant of Claim 31 . 

48. Oil obtained from the seeds of Claim 32. 

49. A method for making an oilseed plant having an altered fatty acid profile 
which comprises: 

a) transforming a plant with the recombinant construct of Claim 32, 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises 
at least 1.0 % of at least one polyunsaturated fatty acid having at 
least twenty carbon atoms and five or more carbon-carbon double 
bonds. 

50. An oilseed plant made by the method of Claim 34. 

51 . Seeds obtained from the plant of Claim 35. 

52. Oil obtained from the seeds of Claim 36. 

53. A food product or food analog which has incorporated therein the oil of 
Claim 21. 
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54. A food product or food analog which has incorporated therein the oil of 

Claim 41 . ^ ... xi. i 

55. A food product or food analog which has incorporated therein the oil of 

Claim 42. . ^ ... • xu. i * 

5 56. A food product or food analogwhich has incorporated therein the oil of 

Claim 43. . . ^. ., , 

57. A food product or food analog which has incorporated therein the oil of 

Claim 44. , ^ x.. • xw i ^* 

58. A food product or food analog which has incorporated therein the oil of 

10 Claim 48. . ^ . ^. ... 

59. A food product or food analog which has incorporated therein the oil of 

Claim 52. . , i. ^ r *u 

60 The food product of Claim 53 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 

15 products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a petfood, 
animal feed or aquaculture feed. 

61 The food product of Claim 54 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 

20 products, a cereal food, a snack food, a baked good, an extruded food, a fned food 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

62 The food product of Claim 55 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle meat 

25 products, a cereal food, a snack food, a baked good, an extruded food, a fried food 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

63 The food product of Claim 56 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle meat 

30 products, a cereal food, a snack food, a baked good, an extruded food, a fried food 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

64 The food product of Claim 57 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle meat 

35 products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 
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10 



15 



20 



25 



30 



35 



65. The food product of Claim 58 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquacuiture feed. 

66. The food product of Claim 59 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquacuiture feed. 

67. A beverage which has incorporated therein the oil of Claim 21 . 
A beverage which has incorporated therein the oil of Claim 41. 
A beverage which has incorporated therein the oil of Claim 42. 
A beverage which has incorporated therein the oil of Claim 43. 
A beverage which has incorporated therein the oil of Claim 44. 
A beverage which has incorporated therein the oil of Claim 48. 
A beverage which has incorporated therein the oil of Claim 52. 
Infant formula which has incorporated therein the oil of Claim 21. 
Infant formula which has incorporated therein the oil of ciaim 41 , 
Infant formula which has incorporated therein the oil of Claim 42. 
Infant formula which has incorporated therein the oil of Claim 43. 
Infant formula which has incorporated therein the oil of Claim 44. 
Infant formula which has incorporated therein the oil of Claim 48. 
Infant formula which has incorporated therein the oil of Claim 52. 
A nutritional supplement which has incorporated therein the oil of 



68. 
69. 
70. 
71. 
72. 
73. 
74. 
75. 
76. 
77. 
78. 
79. 
80. 
81. 
Claim 21, 
82. 

Claim 41. 

83. 

Claim 42. 
84. 

Claim 43. 
85. 

Claim 44. 
86. 

Claim 48. 
87. 

Claim 52. 



A nutritional supplement which has incorporated therein the oil of 
A nutritional supplement which has incorporated therein the oil of 
A nutritional supplement which has incorporated therein the oil of 
A nutritional supplement which has incorporated therein the oil of 
A nutritional supplement which has incorporated therein the oil of 
A nutritional supplement which has Incorporated therein the oil of 
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88. A food product or food analog which has incorporated therein the seed of 
^'^"^Sa A food product or food analog which has incorporated therein the seed of 
^'^""ga ■ A food product or food analog which has incorporated therein the seed of 
^'^""qT ' A food product or food analog which has incorporated therein the seed of 
^'^'""oz ' A food product or food analog which has incorporated therein the seed of 

^'^""93° A petfood which has incorporated therein the seed of Claim 16. 

94 A pet food which has incorporated therein the seed of Claim 37. 

95 A pet food which has incorporated therein the seed of Claim 38. 

96 A pet food which has incorporated therein the seed of Claim 39. 

97 A pet food which has incorporated therein the seed of Claim 40. 

98 Animal feed which has incorporated therein the seed of Claim 16. 

99 Animal feed which has incorporated therein the seed of Claim 37. 

1 00 Animal feed which has incorporated therein the seed of Claim 38. 

101 Animal feed which has incorporated therein the seed of Claim 39. 

1 02 Animal feed which has incorporated therein the seed of Claim 40. 

1 03. A pet food which has incorporated therein the oil of Claim 21 . 

1 04. A pet food which has incorporated therein the oil 

1 05. A pet food which has incorporated therein the oil 

1 06. A pet food which has incorporated therein the oil 

1 07. A pet food which has incorporated therein the oi 

1 08. A pet food which has incorporated therein the oi 

1 09 A pet food which has Incorporated therein the oil 

110 Animal feed which has incorporated therein the oil of Claim 21. 

1 1 1 Animal feed which has incorporated therein the oil of Claim 41 . 

112. Animal feed which has incorporated therein the oil of Claim 42. 

1 1 3. Animal feed which has incorporated therein the oil of Claim 43. 

1 14 Animal feed which has incorporated therein the oil of Claim 44. 

1 1 5 Animal feed which has incorporated therein the oil of Claim 48. 

1 16. Animal feed which has incorporated therein the oil of Claim 52. 

1 17. A whole bean soy product made from the seed of Claim 16. 

118. A whole bean soy product made from the seed of Claim 37. 

119. A whole bean soy product made from the seed of Claim 38. 
1 20 A whole bean soy product made from the seed of Claim 39. 
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121. A whole bean soy product made from the seed of Claim 40. 

122. An aquaculture food product which has incorporated therein the oil of 
Claim 21. 

123. An aquaculture food product which has incorporated therein the oil of 
5 Claim 41. 

124. An aquaculture food product which has incorporated therein the oil of 
Claim 42. 

125. An aquaculture food product which has incorporated therein the oil of 
Claim 43. 

10 126. An iaquaculture food product which has incorporated therein the oil of 

Claim 44. 

127. An aquaculture food product which has incorporated therein the oil of 
Claim 48. 

128. An aquaculture food product which has incorporated therein the oil of 
15 Claim 52. 

129. An aquaculture food product which has incorporated therein the seed of 
Claim 16. 

130. An aquaculture food product which has incorporated therein the seed of 
Claim 37. 

20 131. An aquaculture food product which has incorporated therein the seed of 

Claim 38. 

132. An aquaculture food product which has incorporated therein the seed of 
Claim 39. 

133. An aquaculture food product which has incorporated therein the seed of 
25 Claim 40. 

134. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of EPA:DHA 
is in the range from 1 : 1 00 to 860: 1 00. 

30 135. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more. carbon-carbon double bonds wherein the ratio of EPA:DHA 
is in the range from 1 :100 to 860:100 and further wherein the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

35 136. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of DHA:EPA 
is in the range from 1:100 to 110:100. 
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137 An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and f.e or more carbon-carbon double bonds where.n the -^-^^^^^^^^^^ 
is in the range from 1 :100 to 1 10:100 and further where.n the total seed fatty acd 
profile further comprises less than 2.0% arachidonic acid. 

138 Seeds obtained from the plant of any of Claims 134-137. 

139 Oil obtained from the seeds of the plants of any of Claims 134-137. 
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FIGURE 1 
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SEQUENCE LISTING 

<110> E.I. du Pont de Nemours and Company 

<120> Production of Long Chain Polyunsaturated Fatty Acids in Plants 

<130> BB1538 PCT 

<140> 
<141> 

<150> US 60/446,941 

<151> 2003-02-12 

<160> 98 

<170> Patentin version 3.2 



<210> 
<211> 
<212> 



1 

24 
DNA 



<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 1 

gccccccatc ctttgaaagc ctgt 

<210> 2 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 2 

cgcggatccg agagcctcag catcttgagc agaa 



24 



34 



<21G> 3 

<211> 2012 

<212> DNA 

<213> Glycine max 



<400> 3 

atcttaggcc 

cccttttcct 

ttttgaaagg 

ttttcctaca 

aatcactcta 

agcttgcaat 

tattaatcta 

atttccccgt 

tacatgaaaa 

atatagttaa 

atttatgcat 

taaaaaaaga 

ttttcttttc 



cttgattata 
ttgaataact 
atatttttcg 
ctctcaagct 
tttataaatg 
gtacaaatta 
acatctaagt 
gactattttg 
gaaatattgt 
ttattatatt 
atgtaaagta 
ctgatatgaa 
ttttctttca 



tggtgtttag 
gaccaagaac 
ctcctattca 
ttgtttttcg 
gcataaggta 
atgtttttca 
aaggctagtg 
gctttgactc 
ttatattctt 
acttttttct 
cattatattt 
aaatttattc 
ttttctattc 



atggattcac 
aacaagaaaa 
aatactgtat 
cttcgactct 
gtgtgaacaa 
tgcctttcaa 
aattttttcg 
caacactggc- 
aattaaaaat 
ctattctatt 
ttgctatata 
tttttaaagc 
aatttaataa 



atgcaagttt 
aaaaaaaaag 
ttttaccaaa 
catgatttcc 
ttgcaaagct 
aattatctgc 
aatagtcatg 
cccgtacatc 
attgtccctt 
agttctattt 
cttaaatatt 
tatatcattt 
gaaataaatt 



ttatttcaat 
aaaaggatca 
aaaactgtat 
ttcatatgcc 
tgtcatcaaa 
accccctagc 
cagtgcatta 
cgtccctcat 
ctaaattttc 
tcaaattatt 
tctaaattat 
tatatatact 
ttgtaaattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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ttatttatca 
aatcatatca 
aaatgtgtta 
tctctcattt 
taattatttt 
acactccaaa 
atgaaactga 
tggcatctga 
ctattaatta 
atatgttatt 
ttatattgat 
cataatttta 
gagaaaaaaa 
tcataaattc 
ctacgtcatc 
ccatgaatca 
tctctttccg 
atttattaac 
attagaacat 
gatccaattt 
aaaggaaagg 



atttataaaa 
ttatgattga 
tttcctaaaa 
catttttctc 
tactctcttt 
attggggcat 
aaagcattgg 
ttgattcagt 
atcttaaata 
ttttgtttag 
taaacatata 
taaggataaa 
aatattttag 
ctatatgtat 
tcttgccatt 
ctcttcacac 
tttcttttta 
gtgccaaaag 
gaaaaagcat 
cactatataa 
titaactaaga 



atattttact 
aagagaggaa 
aaaacctaaa 
tttatctctt 
attttttctc 
gcctttatca 
caagtctcct 
atatctattg 
aataaaggat 
atgtatattc 
atcaatatta 
aaatatgata 
Gcataaataa 
atttgaaatt 
tgtttttcat 
catttttact 
caacactttc 
atgcttattg 
tgttctaaca 
attgtgacga 
agcaatactt: 



ttatatgttt 
attgacagtg 
caaacatgta 
tctttatttt 
tctatccctc 
ctactctatc 
cccctcctca 
catgtgtaaa 
aaaatatttt 
gaataaatct 
aatatgatat 
aaaataaatt 
atgaccagca 
aaaaacagat 
gcaaacagaa 
agcaaacaag 
tttgaaatag 
aatagagtgc 
cgataatcct 
aagcaaaatg 
ca 



tttcacattt 
agtaataagt 
tctactctct 
tttatcatat 
tcttatttcc 
tcctccacta 
agtgatttcc 
agtctttcca 
tttttcttca 
aaatatatga 
ttttttatat 
ttaaatattt 
tattttacaa 
aatcgttaag 
agggacgaaa 
tctcaacaac 
tagtattttt 
acatttgtaa 
gtgaaggcgt 
aattcacata 



ttgttaaaca 
gatgagaaaa 
atttcatcta 
catttcacat 
actcatatat 
aatcatttaa 
aactcagcat 
caatacataa 
taaaattaaa 
taatgatttt 
aggttgtaca 
ttatatttac 
ccttagtaat 
ggaaggaatc 
aaccacctca 
tgaagccagc 
ttttcacatg 
tgtactacta 
taactccaaa 
gctgagagag 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2012 



<210> 4 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 4 

ggtccaatat ggaacgatga gttgata 



27 



<210> 5 

<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 5 

cgcggatccg ctggaactag aagagagacc taaga 



35 



<210> 6 

<211> 1408 

<212> DNA 

<213> Glycine max 



<400> 6 

aactaaaaaa 

aaaactccaa 

aaggtgatgt 

ttccccaaga 

taattttaac 

tcttgaatgc 

cgattttaca 

tactaatcta 

atatatgcat 

cctaaatgag 



agctctcaaa 
ctaaaataac 
gcctctagca 
cgtgtacgca 
agtgccgact 
agtgtcgagt 
tgtgtgaatc 
ttcctttcat 
tattaattat 
tctaactatt 



ttacattttg 
aaatagcaca 
gtctagctta 
gattaaacaa 
tttttctgta 
ttggaccatt 
agacaaaagt 
tttatatgct 
tgcagtatta 
gcatatgaat 



agttgtttca 
tgcaggtgca 
tgaggctcgc 
tggacaaaac 
aacaaaaggc 
tgagtacaaa 
gcatgcaatc 
ccacttttat 
tgctattggt 
caaatgaagg 



ggttccattg 
aacaacacgt 
tgcttatcaa 
ttcaatcgat 
cagaatcata 
gccaatattg 
acttgcaagt 
ataaaaaaat 
tttatggccc 
aagaatcatg 



ccttattgct 
tactctgatg 
cgattcatca 
tatagaataa 
tcgcacatca 
aatgattttt 
aaattaagga 
atacattatt 
tgctaaataa 
atctaaacct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



2 



wo 2004/071467 



PCT/US2004/005758 



gagtacccaa 
gacgtataaa 
cttcttttaa 
ataagctgca 
tttgaaaaat 
cgtgattcaa 
gaattttttt 
ttgtgttagc 
ttttataatc 
gcccctgctg 
aaaaagttgg 
tatcagaaat 
gctagctcac 
tcaaatcata 



tgcaataaaa 
ttaatgcata 
ctttgaaata 
atttcacatt 
gtcctacttc 
gcagtaatga 
ttaaatggat 
acaaaactta 
caaaataatc 
gataatgaaa 
caaactgtca 
taatttaaac 
taatcactat 
aaattcttcc 



<210> 7 

<211> 898 

<212> DNA 

<213> Glycine max 



<400> 7 

tatatatgtg 

agaaaaaaga 

atcggtgacc 

tggataataa 

cctcagttat 

ttttgtttca 

gaattttttt 

attctttaat 

cactccatct 

tctggttgtg 

ttatttctag 

aacatagagt 

cacagcacac 

gtcaagcagc 

ataactataa 



agggtagagg 
aaagggtttg 
aaacctcttt 
aaaagaaaaa 
aaaaaggaaa 
actcaatatt 
tctcctttta 
aatgaatcat 
cttgttaatt 
tattttgtta 
gatcat.gcat 
atcctttcca 
ataagtggat 
aagccctccc 
atagccctaa 



tgcgtcctat 
taggttattt 
agctgataaa 
ttaccaaacc 
acattcacat 
aagtcgaaat 
atatacaagt 
cagaaagaga 
aaattaaaga 
attcatatca 
ttccctgttg 
caaatatata 
atatacgaca 
accaagtt 



tacctaaact 
tgagaaaaga. 
aatacgcttt 
gaaacaagaa 
caaattaatt 
cagatagaat 
ttactatttt 
taaattttaa 
atattggcta 
aaacaataca 
gtttttaagc 
gctacgaggg 
tgctacaagt 



gtatcacatg 
caactaaaaa 
ttgataccat 
tgatatttaa 
acaaaacaac 
tagcttttaa 
ttttatttgc 
tagtcttaca 
tgatgggcat 
tctgttggta 
atacgcgcaa 
aaacaacgtc 
gatcaagatg 
gtggccaaat 
tctcactcca 



agctctggat 
cttgggaaag 
cttccattta 
tctaagttca 
gtacaactca 
taattaacta 
atgtatatta 
tcattacatg 
ccattactta 
ctctggagta 
accaaagaac 
caagttcata 
ggctcgtcca 
gtgcatgcat 
tgtttcatcg 



tcaaacacac 
aaacatcaaa 
aaatcaactg 
tggtaacagt 
acaactaaat 
atacacgttt 
atatataatg 
ataaagagaa 
gatagaccgg 
gaagttctag 
caaatcacaa 
aacttcttca 
gaagtgacca 



attgccatcg 
agctctaaaa 
tgtgctgtat 
gaggcaaaaa 
aa'ataaacat. 
aacatcaatt 
aaaattcatt 
ttatatccaa 
ctttttcact 
tttaataata 
ttcaattacg 
gtcattacta 
tatcttaatt 



ttccataatg 
aacaaaggtt 
atctagaata 
aacaactcga 
atcagatttc 
aggtttttat 
ggagttgtcc 
atacacatgt 
tcaaccatcc 
gcatgcataa 
agagaccgat 
aaatagagac 
tgccacgcac 
acatgttaac 
tccaataata 



aaaaggaatc 
taatcttggg 
tgaaaataag 
ttagtccttt 
aatttgctta 
attatattta 
aatgataatt 
atgagatgtc 
gccatagtta 
cgctatattt 
gtaaagacaa 
gaaatgcaag 
accaacacac 
aagagcttgc 
tatatact 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1408 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
898 



<210> 8 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 8 

cgcggatcct atatatgtga gggtagaggg tatcac 

<210> 9 

<211> 44 

<212> DNA 

<213> Artificial Sequence 



36 



<220> 
<223> 



synthetic oligonucleotide 



<400> 9 

gaattcgcgg ccgcagtata tatattattg gacgatgaaa catg 



44 
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<210> 10 

<211> 690 

<212> DNA 

<213> Glycine max 



<400> 10 

tagcctaagt 

aaacaaatta 

gataaatagt 

atttggttga 

aaaaataagt 

gaaattgaaa 

aatccaggaa 

atttgcaata 

acctcacacc 

aagaatgtct 

cttccctata 

ttggtcctta 



acgtactcaa 
ataaaacact 
taatattttt 
aaatattaat 
gtacgtggtt 
gcgagtctaa 
gaaaagaaat 
gaaacactga 
atgaacttca 
caagctcagc 
aataaccacg 
aacactcatc 



aatgccaaca 
tacaacaccg 
aataattatt 
atgtttaaat 
aacattagta 
tttttaaatt 
gaaaccatgc 
aacacctttc 
tgaggtgtag 
accctacttc 
cctcaggttc 
agtcatcacc 



aataaaaaaa 
gatttttttt 
taaaaagccg 
caacacaatc 
cag-baatiata 
atgaacctgc 
atggtcccct 
tctttgtcac 
cacccaaggc 
tgtgacgttg 
tccgcttcac 



aagttgcttt 
aattaaaatg 
tatctactaa 
tatcaaaatt 
agaggaaaat 
atatataaaa 
cgtcatcacg 
ttaattgaga 
ttccatagcc 
tccctcattc 
aactcaaaca 



aataatgcca 
tgccatttag 
aatgattttt 
aaactaaaaa 
gagaaattaa 
ggaaagaaag 
agtttctgcc 
tgccgaagcc 
atgcatactg 
accttcctct 
ttctcctcca 



<210> 11 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 11 

cgcggatcct agcctaagta cgtactcaaa atgcca 

<210> 12 

<211> 41 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 12 

gaattcgcgg ccgcggtgat gactgatgag tgtttaagga c 

<210> 13 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 13 

ttgcggccgc aaaccatggc tgctgctccc ag 

<210> 14 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
690 



36 



41 



32 
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<400> 14 

aagcggccgc ttactgcgcc ttac 



24 



<210> 15 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 15 

atctagacct gcaggccaac tgcgtttggg gctc 34 



<211> 40 

<212> DNA 

<213> Artificial. Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 16 

cttttaactt cgcggccgct tgctattgat gggtgaagtg 40 

<210> 17 

<211> 38 

<212> DNA 

<213> Artificial Sequence 

<220> ' " 

<223> synthetic oligonucleotide . _ . - 



<210> 18 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic, oligonucleotide 

<400> 18 

aatctagacg tacgcaaagg caaagattta aactc 35 

<210> 19 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 



<210> 



16 



<400> 17 

caatagcaag cggccgcgaa gttaaaagca atgttgtc 



38 



<400> 19 

tttctagacg tacgtccctt cttatctttg atctcc 



36 



<210> '20 
<211> 34 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 20 

gcggccgcag ttggatagaa tatatgtttg tgac 34 

<210> 21 

<211> 41 ' 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<210> 22 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 22 

aatctagacg tacgtgaagg ttaaacatgg tgaatatg 38 

<210> 23 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<210> 24 

<211> 22 

<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> synthetic oligonucleotide 

<400> 24 

ttctagacgt acggatataa tg 22 

<210> 25 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<400> 21 

ctatccaact gcggccgcat ttcgcaccaa atcaatgaaa g 



41 



<400> 23 

atctagacgt acgtcctcga agagaaggg 



29 
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<400> 25 

tttctagacg tacggtctca atagattaag aagttg 



36 



<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 26 

gcggccgcga agagagatac taagagaatg ttg 33 

<210> 27 

<211> 39 

<212> DNA 

<213> Artificial Sequence 



<210> 28 

<211> 36 

<212> DNA 

<213> Artificial Sec[uence 

<220> / 

<223> synthetic oligonucleotide 

<400> 28 

tttctagacg tacgtcaaaa aatttcattg taactc 36 

<210> 29 

<211> 37 

<212> DNA 

<213> Artificial Secjuence 

<220> 

<223> synthetic oligonucleotide 

<400> 29 

cgcggatcca tcttaggccc ttgattatat ggtgttt 37 

<210> 30 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<220> 
<223> 



synthetic oligonucleotide 



<400> 27 

gtatctctct tcgcggccgc atttggcacc aaatcaatg 



39 



<400> 30 

gaattcgcgg ccgctgaagt attgcttctt agttaacctt tec 



43 
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<210> 31 

<211> 41 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 31 

cgcggatcca actaaaaaaa gctctcaaat tacattttga g 

<210> 32 

<211> 44 

<212> DNA 

<213> Artificial Sequence 



41 



<220> 

<223> synthetic oligonucleotide 

<400> 32 

gaattcgcgg ccgcaacttg gtggaagaat tttatgattt gaaa 

<210> 33 

<211> 1617 

<212> DNA 

<213> Mortierella alpina 



44 



<400> 33 

cgacactcct 

acaacaaacc 

tgccgaggct 

caacaaggtg 

cacgcacgtt 

ggagactctt 

tgatgacttt 

cgattcttcc 

gtcgacggtc 

tgcgcttttg 

ccaggtcttc 

ccagggcttc 

cgtccacggc 

gttggagatg 

ggtcctgaac 

cctccagtcc 

tgtgcccatc 

caccatgttc 

ggcggtgtgc 

gatctcgaag 

tgatgtccac 

gcaccacttg 

gaccctgtgc 

agaggtcttt 

gtaaaaaaaa 

tgtcaagtcg 

cccccgctca 



tccttcttct 

atggctgctg 

ctgaatgagg 

tacgatgtcc 

ggcaaggacg 

gccaactttt 

gcggccgagg 

aaggcatact 

attgtggcca 

ggtctgttct 

caggaccgtt 

tcgtcctcgt 

gaggatcccg 

ttctcggatg 

cagacctggt 

attctctttg 

tcgttggtcg 

ctgttcatca 

ggaaacttgt 

gaggaggcgg 

ccgggtctat 

ttcccttcga 

aaaaagtaca 

agccgtctga 

aaacaaggac 

agcgtttctg 

tatctcattc 



cacccgtcct 

ctcccagtgt 

gcaagaagga 

gcgagttcgt 

gcactgacgt 

acgttggtga 

tccgcaagct 

acgccttcaa 

agtggggcca 

ggcagcagtg 

tctggggtga 

ggtggaagga 

acattgacac 

tcccagatga 

tttacttccc 

tgctgcctaa 

agcagctgtc 

aggatcccgt 

tggcgatcgt 

tcgatatgga 

ttgccaactg 

tgcctcgcca 

atgtccgata 

acgaggtctc 

gttttttttc 

gaaaggatcg 

atttctctta 



agtccccttc 

gaggacgttt 

tgccgaggca 

ccctgatcat 

ctttgacact 

tattgacgag 

gcgtaccttg 

ggtctcgttc 

gacctcgacc 

cggatggttg 

tcttttcggc 

caagcacaac 

ccaccctctg 

ggagctgacc 

cattctctcg 

cggtcaggcc 

gcttgcgatg 

caacatgctg 

gttctcgctc 

tttcttcacg 

gttcacgggt 

caacttttca 

ccacaccacc 

caaggctgcc 

gccagtgcct 

ttcagtgcag 

ttaaacaact 



aacccccctc 

actcgggccg 

cccttcttga 

cccggtggaa 

tttcaccccg 

agcgaccgcg 

ttccagtctc 

aacctctgca 

ctcgccaacg 

gctcacgact 

gccttcttgg 

actcaccacg 

ttgacctgga 

cgcatgtggt 

tttgcccgtc 

cacaagccct 

cactggacct 

gtgtactttt 

aaccacaacg 

aagcagatca 

ggattgaact 

aagatccagc 

ggtatgatcg 

tccaagatgg 

gtgcctgtgc 

tatcatcatt 

tgttcccccc 



tttgacaaag 

aggttttgaa 

tgatcatcga 

gtgtgattct 

aggctgcttg 

atatcaagaa 

ttggttacta 

tctggggttt 

tgctctcggc 

ttttgcatca 

gaggtgtctg 

ccgcccccaa 

gtgagcatgc 

cgcgtttcat 

tctcctggtg 

cgggcgcgcg 

ggtacctcgc 

tggtgtcgca 

gtatgcctgt 

tcacgggtcg 

atcagatcga 

ctgctgtcga 

agggaactgc 

gtaaggcgca 

ctgcttccct 

ctccttttac 

ttcaccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1617 



<210> 34 

<211> 457 

<212> PRT 

<213> Mortierella alpina 
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<400> 34 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 5 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met lie lie Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val lie Leu Thr His Val Gly Lys Asp Gly 
50 55 . 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 

Ala Asn Phe Tyr Val Gly Asp lie Asp Glu Ser Asp Arg Asp lie Lys 

85 90 95 

Asn Asp Asp Phe 7U.a Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 . 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys lie Trp Gly Leu Ser Thr Val lie Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala' His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin ply Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 

lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser lie Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro lie Ser Leu Val Glu 
290 295 300 
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Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val He Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

<210> 35 

<211> 1362 

<212> DNA 

<213> Saprolegnia diclina 



<400> 35 

atggtccagg 

caagacaacg 

cacccgggcg 

gtcttccacc 

cagtcgacgg 

ttcattgcgt 

agcaagctct 

gccatttgcc 

ctcttttacc 

aaccacttgt 

gtgcagtggt 

cccgagatcg 

ctcaagatgg 

gcgtacctgt 

atgtacgcct 

ctcgagcgcg 

aacatgtcgc 

ttcctcgcga 

aagcccgatt 

atcgactggt 

ccccggcaca 

atcccatacc 

cgcatctcga 



ggcaaaaggc 
cgtggatcgt 
gcgtcgtcat 
cgagctcggc 
cggccgtcga 
cgtaccgcaa 
actacctcta 
tccactttga 
agcagtgcgg 
ttggcgacct 
ggaagaacaa 
ccttccacgg 
cgcagcacgc 
actttcccat 
tctacaacgt 
ccggcctcct 
tgctccaagc 
tggtctttag 
tttggaagct 
tcatgggcgg 
acctcccggc 
acgagacggg 
tcgagttctt 



cgagaagatc 
gatccaccac 
gttcacgcag 
gctcaagctc 
cacgtcgatc 
gctgcgcctt 
caagtgcgcc 
ctcgacggcc 
ctggctcgcc 
cgtcggcgtc 
gcacaacacg 
cgacccggac 
ggtcgactcg 
cttgctcttt 
tgggcccggc 
cctctactac 
ggctgcgttc 
cgtcggccac 
gcaagtgctc 
cctcaactac 
gctcaacgtg 
cttcatcgcg 
caaggagttt 



tcgtgggcga 
aaggtgtacg 
gccggcgaag 
ctcgagcagt 
tcggacgagg 
gaagtcaagc 
tcgacgctga 
atgtacatgg 
catgactttc 
atggtcggca 
caccatgcga 
attgacacga 
cccgtcgggc 
gcgcgtatct 
ggcacctttg 
ggctggaacc 
ctctttgtga 
aacggcatgg 
tcgacgcgca 
cagatcgacc 
ctcgtcaagt 
ggcatggccg 
cccgccatgt 



ccatccgtga 
acatctcggc 
acgcgaccga 
actacgtcgg 
tcaagaagag 
gcctcggctt 
gcattgcgct 
tcgcggctgt 
tgcaccacca 
acctctggca 
tccccaacct 
tgccgattct 
tcttcttcat 
cgtgggtgat 
acaaggtcca 
tcggccttgt 
gccaggcgtc 
aggtctttga 
acgtgacgtc 
accacttgtt 
cgctctgcaa 
aggtcgtcgt 
aa 



gcacaaccgc 
ctttgaggac 
tgcgttcgct 
cgacgtcgac 
ccagtcggac 
gtacgactcg 
tgtgtcggcg 
catccttggc 
agtgtttgag 
gggcttctcg 
ccacgcgacg 
cgcgtggtcg 
gcgctaccaa 
ccagtcggcc 
gtacccgctg 
gtacgcagcc 
gtgcggcctc 
caaggacagc 
gtcgctctgg 
cccgatggtg 
gcagtacgac 
gcacctcgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1362 
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<210> 36 

<211> 453 

<212> PRT 

<213> Saprolegnla diclina 

<400> 36 

Met Val Gin Gly Gin Lys Ala Glu Lys lie Ser Trp Ala Thr lie Arg 
1 5 10 15 

Glu His Asn Arg Gin Asp Asn Ala Trp lie Val lie His His Lys Val 
20 25 30 

Tyr Asp lie Ser Ala Phe Glu Asp His Pro Gly Gly Val Val Met Phe 
35 40 45 

Thr Gin Ala Gly Glu Asp Ala Thr Asp Ala Phe Ala Val Phe His Pro 
50 55 60 

Ser Ser Ala Leu Lys Leu Leu Glu Gin Tyr Tyr Val Gly Asp Val Asp 
65 70 75 80 

Gin Ser Thr Ala Ala Val Asp Thr Ser He Ser Asp Glu Val Lys Lys 
85 90 95 

Ser Gin Ser Asp Phe He Ala Ser Tyr Arg Lys Leu Arg Leu Glu Val 
100 105 110 

Lys Arg Leu Gly Leu Tyr Asp Ser Ser Lys Leu Tyr Tyr Leu Tyr Lys 
115 120 125 

Cys Ala Ser Thr Leu Ser He Ala Leu Val Ser Ala Ala He Cys Leu 
130 135 140 

His Phe Asp Ser Thr Ala Met Tyr Met Val Ala Ala Val He Leu Gly 
145 150 155 160 

Leu Phe Tyr Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His His 
165 170 175 

Gin Val Phe Glu Asn His Leu Phe Gly Asp Leu Val Gly Val Met Val 
180 185 , 190 

Gly Asn Leu Trp Gin Gly Phe Ser Val Gin Trp Trp Lys Asn Lys His 

195 200 205 

Asn Thr His His Ala He Pro Asn Leu His Ala Thr Pro Glu He Ala 
210 215 220 

Phe His Gly Asp Pro Asp He Asp Thr Met Pro He Leu Ala Trp Ser. 
225 230 235 240 

Leu Lys Met Ala Gin His Ala Val Asp Ser Pro Val Gly Leu Phe Phe 
245 250 255 

Met Arg Tyr Gin Ala Tyr Leu Tyr Phe Pro He Leu Leu Phe Ala Arg 
260 265 270 

He Ser Trp Val He Gin Ser Ala Met Tyr Ala Phe Tyr Asn Val Gly 
275 280 285 
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Pro Gly 
290 

Gly Leu 
305 



Gly Thr Phe Asp 
Leu Leu 



Asn Met Ser Leu 



Ser Cys 
Met Glu 

Val Leu 
370 

Met Gly 
385 

Pro Arg 
Lys Gin 
Ala Glu 



Gly Leu 
340 

Val Phe 
355 



Tyr Tyr 
310 

Leu Gin 
325 

Phe Leu 
Asp Lys 



Lys Val Gin 
295 

Gly Trp Asn 
Ala Ala Ala 



Ser Thr Arg Asn 



Gly Leu 
His Asn 



Tyr Asp 
420 

Val Val 
435 



Asn Tyr 
390 

Leu Pro 
405 

lie Pro 



Val His 



Ala Met Val 
345 

Asp Ser Lys 
360 

Val Thr Ser 
375 

Gin lie Asp 



Ala Leu Asn 



Tyr Pro Leu 
300 

Leu Gly Leu 
315 

Phe Leu Phe 

330 

Phe Ser Val 
Pro Asp Phe 



Tyr His Glu 
425 

Leu Glu Arg 
440 



Ser Leu Trp 
380 

His His Leu 
395 

Val Leu Val 

410 

Thr Gly Phe 



Leu Glu Arg Ala 



Val Tyr Ala Ala 
320 

Val Ser Gin Ala 

335 

Gly His Asn Gly 
350 

Trp Lys Leu Gin 
365 

lie Asp Trp Phe 



lie Ser lie 



Phe Pro Met Val 
400 

Lys Ser Leu Cys 
415 

lie Ala Gly Met 
430 

Glu Phe Phe Lys 
445 



Glu Phe Pro Ala Met 
450 

<210> 37 

<211> 1413 

<212> DNA 

<213> Saprolegnia diclina 



<400> 37 

atggccccgc 

ggcaagaagg 

atcattatcc 

cgcgagatgg 

ccgttcagcg 

ccgtccgagt 

gttggcgagt 

cgcatgatgg 

atctttgcgc 

ctccacgtca 

gtcgtcggcc 

cagcacgtcg 

gtcaacatgg 

gcattccagc 

caggacttca 

gcgctctcga 

taccttccgc 

gccgagtttg 

gagtgcggct 

tcgcaggtca 

ggcgcgctca 

ccggcgatcg 



agacggagct 
cctttacatg 
gcggcaaggt 
tgctgctgca 
acaaggccga 
ttccgacctt 
acttcaagaa 
tcgtgtttgc 
tgcagctcgc 
tgcacgactc 
gctttgccat 
tgggccacca 
acggcgacat 
acatctacct 
ccgacacgtt 
cgtggatggc 
ttgccgtgct 
tcacgggctg 
acccatgcgg 
agacgtcggt 
actaccaggt 
cgcccatcat 



ccgccagcgc 
gcaggaggtc 
ctacgacgtg 
cgccggtcgc 
gtcgatcttg 
caagccggac 
gaacaacctc 
ggtcgccggc 
ggccgcggcg 
gtcgcacgcg 
ggactggttt 
catctacacg 
ccgccgcatc 
tccgccgctc 
cggctcgcac 
catgatcagc 
ccagatgccc 
gtacctcgcg 
cgacgaggcc 
cgactacgcc 
cgtgcaccac 
cgtcgacgtc 



cacgccgccg 
gcgcagcaca 
accgagtggg 
gaggccaccg 
aacaagtatg 
acgggcttct 
catccgcagg 
ctcgccttgt 
ctctttggcg 
tcgtacacca 
gccggcggct 
aacgtcgcgg 
gtgaaccgcc 
tatggcgtgc 
acgaacggcc 
tccaagtcgt 
atcaagacgt 
ttcaacttcc 
aagatggcgc 
catggctcgt 
ttgttcccca 
tgcaaggagt 



tcgccgagac 
acacggcggc 
ccaacaagca 
acacgttcga 
agattggcac 
acaaggagtg 
acggcttccc 
acggcatgca 
tctgccaggc 
acatgccgtt 
cgatggtgtc 
gctcggaccc 
aggtgttcca 
ttggcctcaa 
cgatccgcgt 
tctgggcctt 
accttgcgat 
aagtaagcca 
tccaggacga 
ggatgacgac 
gcgtgtcgca 
acaacatcaa 



gccggtggcc 
ctcggcctgg 
ccccggcggc 
ctcgtaccac 
gttcacgggc 
ccgcaagcgc 
gggcctctgg 
cttttcgact 
gctgccgctg 
cttccattac 
atggctcaac 
ggatcttccg 
gcccatgtac 
gttccgcatc 
caacccgcac 
ctaccgcgtg 
cttcttcctc 
tgtctcgacc 
gtgggcagtc 
gttccttgcc 
gtaccactac 
gtacgccatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ttgccggact ttacggcggc gttcgttgcc cacttgaagc acctccgcaa catgggccag 1380 
cagggcatcg ccgccacgat ccacatgggc taa 1413 

<210> 38 

<211> 470 

<212> PRT 

<213> Saprolegnia diclina 

<400> 38 

Met Ala Pro Gin Thr Glu Leu Arg Gin Arg His Ala Ala Val Ala Glu 
1 5 10 .15 

Thr Pro Val Ala Gly Lys Lys Ala Phe Thr Trp Gin Glu Val Ala Gin 
20 25 30 

His Asn Thr Ala Ala Ser Ala Trp He He lie Arg Gly Lys Val Tyr 
35 40 45 

Asp Val Thr Glu Trp Ala Asn Lys His Pro Gly Gly Arg Glu Met Val 
50 55 60 

Leu Leu His Ala Gly Arg Glu Ala Thr Asp Thr Phe Asp Ser Tyr His 
65 70 75 80 

Pro Phe Ser Asp Lys Ala Glu Ser He Leu Asn Lys Tyr Glu He Gly 
85 90 95 

Thr Phe Thr Gly Pro Ser Glu Phe Pro Thr Phe Lys Pro Asp Thr Gly 
100 105 110 

Phe Tyr Lys Glu Cys Arg Lys Arg Val Gly Glu Tyr Phe Lys Lys Asn 
115 120 125 

Asn Leu His Pro Gin Asp Gly Phe Pro Gly Leu Trp Arg Met Met Val 
130 135 140 

Val Phe Ala Val Ala Gly Leu Ala Leu Tyr Gly Met His Phe Ser Thr 
145 . 150 155 160 

He Phe Ala Leu Gin Leu Ala Ala Ala Ala Leu Phe Gly Val Cys Gin 
165 170 175 

Ala Leu Pro Leu Leu His Val Met His Asp Ser Ser His Ala Ser Tyr 
180 185 190 

Thr Asn Met Pro Phe Phe His Tyr Val Val Gly Arg Phe Ala Met Asp 
195 200 205 

Trp Phe Ala Gly Gly Ser Met Val Ser Trp Leu Asn Gin His Val Val 
210 215 220 

Gly His His He Tyr Thr Asn Val Ala Gly Ser Asp Pro Asp Leu Pro 
225 230 235 240 

Val Asn Met Asp Gly Asp He Arg Arg He Val Asn Arg Gin Val Phe 
245 250 255 

Gin Pro Met Tyr Ala Phe Gin His He Tyr Leu Pro Pro Leu Tyr Gly 
260 265 270 
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Val Leu Gly Leu Lys Phe Arg lie Gin Asp Phe Thr Asp Thr Phe Gly 
275 280 285 

Ser His Thr Asn Gly Pro lie Arg -Val Asn Pro His Ala Leu Ser Thr 
290 295 300 

Trp Met Ala Met lie Ser Ser Lys Ser Phe Trp Ala Phe Tyr Arg Val 
305 310 315 320 

Tyr Leu Pro Leu Ala Val Leu Gin Met Pro lie Lys Thr Tyr Leu Ala 
325 330 335 

lie Phe Phe Leu Ala Glu Phe Val Thr Gly Trp Tyr Leu Ala Phe Asn 
340 345 350 

Phe Gin Val Ser His Val Ser Thr Glu Cys Gly Tyr Pro Cys Gly Asp 
355 360 365 

Glu Ala Lys Met Ala Leu Gin Asp Glu Trp Ala- Val Ser Gin Val Lys 
370 375 380 

Thr Ser Val Asp Tyr Ala His Gly Ser Trp Met Thr Thr Phe Leu Ala 
385 390 395 400 

Gly Ala Leu Asn Tyr Gin Val Val His His Leu Phe Pro Ser Val Ser 
405 410 415 

Gin Tyr His Tyr Pro Ala lie Ala Pro lie He Val Asp Val Cys Lys 
420 425 430 

Glu Tyr Asn He Lys Tyr Ala He Leu Pro Asp Phe Thr Ala Ala Phe 
435 440 445 

Val Ala His Leu Lys His Leu Arg Asn Met Gly Gin Gin Gly He Ala 
450 455 460 

Ala Thr He His Met Gly 
465 470 

<210> 39 
<211> 819 
<212> DNA 

<213> Thraustochytrium aureum 
<400> 39 

atggcaaaca gcagcgtgtg ggatgatgtg gtgggccgcg tggagaccgg cgtggaccag 60 

tggatggatg gcgccaagcc gtacgcactc accgatgggc tcccgatgat ggacgtgtcc 120 

accatgctgg cattcgaggt gggatacatg gccatgctgc tcttcggcat cccgatcatg 180 

aggcagatgg agaagccttt tgagctcaag accatcaagc tcttgcacaa cttgtttctc 240 

ttcggacttt ccttgtacat gtgcgtggtg accatccgcc aggctatcct tggaggctac 300 

aaagtgtttg gaaacgacat ggagaagggc aacgagtctc atgctcaggg catgtctcgc 360 

atcgtgtacg tgttctacgt gtccaaggca tacgagttct tggataccgc catcatgatc 420 

ctttgcaaga agttcaacca ggtttccttc ttgcatgtgt accaccatgc caccattttt 4 80 

gccatctggt gggctatcgc caagtacgct ccaggaggtg atgcgtactt ttcagtgatc 540 

ctcaactctt tcgtgcacac cgtcatgtac gcatactact tcttctcctc ccaagggttc 600 

gggttcgtga agccaatcaa gccgtacatc accacccttc agatgaccca gttcatggca 660 

atgcttgtgc agtccttgta cgactacctc ttcccatgcg actacccaca ggctcttgtg 720 

cagcttcttg gagtgtacat gatcaccttg cttgccctct tcggcaactt ttttgtgcag 780 

agctatctta aaaagccaaa aaagagcaag accaactaa 819 
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<210> 40 
<211> 272 
<212> PRT 

<213> Thraustochytrium aureiam 
<400> 40 

Met Ala Asn Ser Ser Val Trp Asp Asp Val Val Gly Arg Val Glu Thr 
15 10 15 

Gly Val Asp Gin Trp Met Asp Gly Ala Lys Pro Tyr Ala Leu Thr Asp 
20 25 30 

Gly Leu Pro Met Met Asp Val Ser Thr Met Leu Ala Phe Glu Val Gly 
35 40 45 

Tyr Met Ala Met Leu Leu Phe Gly lie Pro lie Met Arg Gin Met Glu 
50 55 60 

Lys Pro Phe Glu Leu Lys Thr lie Lys Leu Leu His Asn Leu Phe Leu 
65 70 75 80 

Phe Gly Leu Ser Leu Tyr Met Cys Val Val Thr He Arg Gin Ala He 
85 90 95 

Leu Gly Gly Tyr Lys Val Phe Gly Asn Asp Met Glu Lys Gly Asn Glu 
100 105 110 

Ser His Ala Gin Gly Met Ser Arg He Val Tyr Val Phe Tyr Val Ser 
115 120 125 

Lys Ala Tyr Glu Phe Leu Asp Thr Ala He Met He Leu Cys Lys Lys 
130 135 140 

Phe Asn Gin Val Ser Phe Leu His Val Tyr His His Ala Thr He Phe 
145 150 155 160 

Ala He Trp Trp Ala He Ala Lys Tyr Ala Pro Gly Gly Asp Ala Tyr 
165 170 175 

Phe Ser Val He Leu Asn Ser Phe Val His Thr Val Met Tyr Ala Tyr 
180 185 190 

Tyr Phe Phe Ser Ser Gin Gly Phe Gly Phe Val Lys Pro He Lys Pro 
195 200 205 

Tyr He Thr Thr Leu Gin Met Thr Gin Phe Met Ala Met Leu Val Gin 
210 215 220 

Ser Leu Tyr Asp Tyr Leu Phe Pro Cys Asp Tyr Pro Gin Ala Leu Val . 
225 230 235 240 

Gin Leu Leu Gly Val Tyr Met He Thr Leu Leu Ala Leu Phe Gly Asn 

245 250 255 

Phe Phe Val Gin Ser Tyr Leu Lys Lys Pro Lys Lys Ser Lys Thr Asn 
260 265 270 

<210> 41 
<211> 1077 



15 



wo 2004/071467 PCT/US2004/005758 



<212> DNA 

<213> Saprolegnia diclina 



<400> 41 

atgactgagg 

ccgaacgcgt 

ttcaacgcgt 

gataacgttc 

ttctggggct 

agcgtcaact 

tggcgcgtga 

ttttacccgc 

ggcggtgcgt 

gacccgtggg 

tgggccgcct 

ggcctctact 

ttgcaccaca 

ggcaacctct 

attggcacgc 

gccaccaagc 

atcacggcct 

gcgcagatct 



ataagacgaa 
gctttgagtc 
cggcctcggc 
tgctccacgc 
tcttcacggt 
ttatcatcgg 
cgcaccgcca 
accggtcggt 
ggtttgtcta 
acccgctcct 
tcttcgccgc 
actatgcgcc 
acgacgaagc 
cgagcgtcga 
accaggtcca 
actttgcggc 
tcttcaagac 
tcacgctcaa 



ggtcgagttc 
gaacctcggc 
ggcgctgctc 
gctcgtttgc 
cggccacgac 
ctgcatcatg 
ccaccacaag 
caaggacctc 
cttgaaggtc 
ccttcgccgc 
gtacgcgtac 
gctctttgtc 
gacgccgtgg 
ccgctcgtac 
ccacttgttc 
cgcgtacccg 
cgcgcacctc 
agagtcggcc 



ccgacgctca 
ctctcgctct 
tacgcggcgc 
gccacctaca 
tgcggccact 
cactctgcga 
aacacgggca 
caggacgtgc 
gggtatgccc 
gcgtcggccg 
ctcacatact 
tttgcttcgt 
tacggcgact 
ggcgcgttcg 
ccgatcattc 
cacctcgtgc 
tttgtcaact 
gcggccgcca 



cggagctcaa 
actacacggc 
gctcgacgcc 
tctacgtgca 
cggccttctc 
ttttgacgcc 
acattgataa 
gccaatgggt 
cgcgcacgat 
tcatcgtgtc 
cgctcggctt 
tcctcgtcat 
cggagtggac 
tggacaacct 
cgcactacaa 
gcaggaacga 
acggcgctgt 
aggccaagtc 



gcactcgatc 
ccgcgcgatc 
gttcattgcc 
gggcgtcatc 
gcgctaccac 
gttcgagagc 
ggacgagatc 
ctacacgctc 
gagccacttt 
gctcggcgtc 
tgccgtcatg 
tacgaccttc 
gtacgtcaag 
gagccaccac 
gctcaacgaa 
cgagcccatc 
gcccgagacg 
ggactaa 



<210> 42 
<211> 358 
<212> PRT 

<213> Saprolegnia diclina 
<400> 42 

Met Thr Glu Asp Lys Thr Lys Val Glu Phe Pro Thr Leu Thr Glu Leu 
15 10 15 

Lys His Ser lie Pro Asn Ala Cys Phe Glu Ser Asn Leu Gly Leu Ser 
20 25 30 

Leu Tyr Tyr Thr Ala Arg Ala lie Phe Asn Ala Ser Ala Ser Ala Ala 
35 40 45 

Leu Leu Tyr Ala Ala Arg Ser Thr Pro Phe lie Ala Asp Asn Val Leu 
50 55 60 

Leu His Ala Leu Val Cys Ala Thr Tyr lie Tyr Val Gin Gly Val lie 
65 70 75 80 

Phe Trp Gly Phe Phe Thr Val Gly His Asp Cys Gly His Ser Ala Phe 
85 90 95 

Ser Arg Tyr His Ser Val Asn Phe lie lie Gly Cys lie Met His Ser 
100 105 110 

Ala lie Leu Thr Pro Phe Glu Ser Trp Arg Val Thr His Arg His His 
115 120 125 

His Lys Asn Thr Gly Asn lie Asp Lys Asp Glu lie Phe Tyr Pro His 
130 135 140 

Arg Ser Val Lys Asp Leu Gin Asp Val Arg Gin Trp Val Tyr Thr Leu 
145 150 155 160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1077 
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Gly Gly Ala Trp Phe Val Tyr Leu Lys Val Gly Tyr Ala Pro Arg Thr 
165 170 175 

Met Ser His Phe Asp Pro Trp Asp Pro Leu Leu Leu Arg Arg Ala Ser 
180 185 190 

Ala Val lie Val Ser Leu Gly Val Trp Ala Ala Phe Phe Ala Ala Tyr 
195 200 205 

Ala Tyr Leu Thr Tyr Ser Leu Gly Phe Ala Val Met Gly Leu Tyr Tyr 
210 215 220 

Tyr Ala Pro Leu Phe Val Phe Ala Ser Phe Leu Val lie Thr Thr Phe 
225 230 235 240 

Leu His His Asn Asp Glu Ala Thr Pro Trp Tyr Gly Asp Ser Glu Trp 
245 250 255 

Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp Arg Ser Tyr Gly Ala 
260 265 270 

Phe Val Asp Asn Leu Ser His His lie Gly Thr His Gin Val His His 
275 280 285 

Leu Phe Pro lie lie Pro His Tyr Lys Leu Asn Glu Ala Thr Lys His 
290 295 300 

Phe Ala Ala Ala Tyr Pro His Leu Val Arg Arg Asn Asp Glu Pro lie 
305 310 315 320 . 

He Thr Ala Phe Phe Lys Thr Ala His Leu Phe Val Asn Tyr Gly Ala 
325 330 335 

Val Pro Glu Thr Ala Gin He -Phe Thr Leu Lys Glu Ser Ala Ala Ala 
340 345 350 

Ala Lys Ala Lys Ser Asp 
355 

<210> 43 
<211> 957 
<212> DNA 

<213> Mortierella alpina 
<400> 43 

atggagtcga ttgcgccatt cctcccatca aagatgccgc aagatctgtt tatggacctt 60 

gccaccgcta tcggtgtccg ggccgcgccc tatgtcgatc ctctcgaggc cgcgctggtg 120 

gcccaggccg agaagtacat ccccacgatt gtccatcaca cgcgtgggtt cctggtcgcg 180 

gtggagtcgc ctttggcccg tgagctgccg ttgatgaacc cgttccacgt gctgttgatc 240 

gtgctcgctt atttggtcac ggtctttgtg ggcatgcaga tcatgaagaa ctttgagcgg 300 

ttcgaggtca agacgttttc gctcctgcac aacttttgtc tggtctcgat cagcgcctac 360 

atgtgcggtg ggatcctgta cgaggcttat caggccaact atggactgtt tgagaacgct 420 

gctgatcata ccttcaaggg tcttcctatg gccaagatga tctggctctt ctacttctcc 480 

aagatcatgg agtttgtcga caccatgatc atggtcctca agaagaacaa ccgccagatc 540 

tccttcttgc acgtttacca ccacagctcc atcttcacca tctggtggtt ggtcaccttt 600 

gttgcaccca acggtgaagc ctacttctct gctgcgttga actcgttcat ccatgtgatc 660 

atgtacggct actacttctt gtcggccttg ggcttcaagc aggtgtcgtt catcaagttc 720 

tacatcacgc gctcgcagat gacacagttc tgcatgatgt cggtccagtc ttcctgggac 780 

atgtacgcca tgaaggtcct tggccgcccc ggatacccct tcttcatcac ggctctgctt 840. 
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tggttctaca tgtggaccat gctcggtctc ttctacaact tttacagaaa gaacgccaag 900 
ttggccaagc aggccaaggc cgacgctgcc aaggagaagg caaggaagtt gcagtaa 957 

<210> 44 
<211> 318 
<212> PRT 

<213> Mortierella alpina 
<400> 44 

Met Glu Ser lie Ala Pro Phe Leu Pro Ser Lys Met Pro Gin Asp Leu 
15 10 15 

Phe Met Asp Leu Ala Thr Ala lie Gly Val Arg Ala Ala Pro Tyr Val 
20 25 30 

Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr lie Pro 
35 40 45 

Thr lie Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro 
50 55 60 

Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu lie 
65 70 75 80 

Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gin lie Met Lys 
85 90 95 

Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe 
100 105 110 

Cys Leu Val Ser lie Ser Ala Tyr Met Cys Gly Gly lie Leu Tyr Glu 
115 120 125 

Ala Tyr Gin Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr 
130 135 140 

Phe Lys Gly Leu Pro Met Ala Lys Met He Trp Leu Phe Tyr Phe Ser 
145 150 155 160 

Lys He Met Glu Phe Val Asp Thr Met He Met Val Leu Lys Lys Asn 
165 170 175 

Asn Arg Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Phe 
180 185 190 

Thr He Trp Trp Leu Val Thr Phe Val Ala Pro Asn Gly Glu Ala Tyr 
195 200 205 

Phe Ser Ala Ala Leu Asn Ser Phe He His Val He Met Tyr Gly Tyr 
210 215 220 

Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gin Val Ser Phe He Lys Phe 
225 230 235 240 

Tyr He Thr Arg Ser Gin Met Thr Gin Phe Cys Met Met Ser Val Gin 
245 250 255 

Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr 
260 265 270 
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Pro Phe Phe lie Thr Ala Leu Leu Trp Phe Tyr Met Trp Thr Met Leu 
275 280 285 

Gly Leu Phe Tyr Asn Phe Tyr Arg Lys Asn Ala Lys Leu Ala Lys Gin 
290 295 300 

Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Arg Lys Leu Gin 
305 310 315 

<210> 45 

<211> 1483 

<212> DNA 

<213> Mortierella alpina 



<400> 45 

gcttcctcca 

gggaacggac 

cgacctactc 

tcctggtgga 

gatgtatcac 

ggtctcgaat 

gagagtcgag 

gggacgatac 

gcctttcgtt 

gtgcgcacaa 

ccccactgtc 

ggtgtggatg 

tcccgacgtg 

tgtcaaccac 

ggtgcgcatt 

caatcccatc 

gtatcgcctg 

ggtcgcggac 

tgaggaagtt 

tatgcaggtc 

tggcagcttg 

tGccgatatt 

caaggatacg 

ccgtcccaag 

caagaatggc 



gttcatcctc 
caaggaaaaa 
ttggccatcc 
gtggacactc 
gcgtttgggg 
gagctgccca 
ggctacttta 
gctcttatct 
gtcgaacgca 
gtcggactca 
tggaagattc 
taccaacata 
tcgacgtctg 
atcaaccagc 
caggacatca 
tcgacatggc 
attgttcccc 
atggtgtcgt 
cagtggccgt 
gagactacgc 
aactaccagg 
ctggccatca 
ttttggcaag 
gaagagtaga 
aaaaggagat 



catttcgcca 
ccttcacctg 
gcggcagggt 
tcctgctcgg 
ctgcagatgc 
tcttcccgga 
cggatcggaa 
ttggatcctt 
catggcttca 
accctcttca 
tgggagccac 
tgctcggcca 
agcccgatgt 
acatgtttgt 
acattttgta 
acactgtgat 
tgcagtatct 
cttactggct 
tgcctgacga 
aggattacgc 
ctgtgcacca 
tcaagaacac 
catttgcttc 
agaaaaaaag 
caagtggaca 



cctgcattct 
ggaagagctg 
gtacgatgtc 
agctggccga 
cattatgaag 
gccaacggtg 
cattgatccc 
gatcgcttcc 
ggtggtgttt 
tgatgcgtct 
gcacgacttt 
tcacccctac 
tcgtcgtatc 
tcctttcctg 
ctttgtcaag 
gttctggggc 
gcccctgggc 
ggcgctgacc 
gaacgggatc 
acacgattcg 
tctgttcccc 
ctgcagcgag 
acatttggag 
cgccgaatga 
ttctctatga 



ttacgaccgt 
gcggcccata 
acaaagttct 
gatgttactc 
aagtactatg 
ttccacaaaa 
aagaatagac 
tactacgcgc 
gcaatcatca 
cacttttcag 
ttcaacggag 
accaacattg 
aagcccaacc 
tacggactgc 
accaatgacg 
ggcaaggctt 
aaggtgctgc 
ttccaggcga 
atccaaaagg 
cacctctgga 
aacgtgtcgc 
tacaaggttc 
cacttgcgtg 
agtattgccc 
aga 



taagcaagat 
acaccaagga 
tgagccgcca 
cggtctttga 
tcggtacact 
ccatcaagac 
cagagatctg 
agctctttgt 
tgggatttgc 
tgacccacaa 
catcgtacct 
ctggagcaga 
aaaagtggtt 
tggcgttcaa 
ctattcgtgt 
tctttgtctg 
tcttgttcac 
accacgttgt 
actgggcagc 
ccagcatcac 
agcaccatta 
cataccttgt 
ttcttggact 
cctttttctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1483 



<210> 46 

<211> 446 

<212> PRT 

<213> Mortierella alpina 



<400> 46 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp 
1 5 10 



Glu Glu Leu Ala Ala 
15 



His Asn Thr Lys Asp Asp Leu Leu Leu Ala lie Arg Gly Arg Val Tyr 

20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 



Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 
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Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Gly Thr 
65 70 75 80 

Leu Val Ser Asn Glu Leu Pro lie Phe Pro Glu Pro Thr Val Phe His 
85 90 95 

Lys Thr lie Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn lie 
100 105 110 

Asp Pro Lys Asn Arg Pro Glu lie Trp Gly Arg Tyr Ala Leu He Phe 
115 ' 120 125 

Gly Ser Leu He Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
145 150 155 160 

Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys He Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val 
210 215 220 

Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 
225 230 235 240 

Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 
245 250 255 

Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 

260 265 270 

Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 
275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 

He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 

305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 
325 330 335 

Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 
340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 
355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 
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Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
385 390 395 400 

Tyr Pro Asp lie Leu Ala lie lie Lys Asn Thr Cys Ser Glu Tyr Lys 
405 , 410 .415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 
420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

<210> 47 

<211> 1350 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 47 

ctctctctct 

gaccaacgca 

tttgatccga 

cactgttggg 

gtcgcggctt 

tgggccgccc 

gggagtttct 

atcctcgttc 

catgttgaaa 

cacagtactc 

ttgtgctaca 

gctccaagcg 

agtcttatcg 

ccgtacatta 

gatgagaagt 

acaattgata 

atccatcatc 

aaacatgtgt 

ttggtggaga 

attgtcttct 

aattaatctc 

tttcttgttg 

tattaatgta 



ctctcttctc 
ccaatgtgaa 
gtgcacaacc 
ttaagagtcc 
tggccatcgc 
aaggaacact 
cagacattcc 
cttaccatgg 
acgacgagtc 
ggatgctcag 
gaagtcctgg 
agagaaagct 
ctctatcttt 
tGtttgtgat 
tgccttggta 
gagattacgg 
tcttcccaca 
tgggaagata 
gtttggtcgc 
acgagacaga 
catttgttta 
tttttaagtt 
tttacattac 



tctttctctc 
cggagatccc 
accgttcaag 
tttgagatca 
tgccgtgtat 
tttctgggcc 
tctactgaat 
ttggagaata 
atgggttccg 
atacactgtc 
aaaagaagga 
tattgcaact 
cgtcttcggt 
gtggttggat 
tagaggcaag 
aatctttaac 
aatccctcac 
ctacagagaa 
aagtattaag 
tccagatctc 
gctctattag 
aaaagtigtac 
aaggcgtaaa 



cccctctctc 
ggcgccggag 
atcggagata 
atgagttacg 
gttgatagct 
atctttgttc 
agtgtggttg 
agccaccgga 
ttaccagaaa 
cctctcccca 
tcacatttta 
tcaactactt 
ccactcgcgg 
gctgtcacgt 
gaatggagtt 
aacattcatc 
tatcacttgg 
ccaaagacgt 
aaagatcatt 
tacgtttacg 
gaataaacca 
tcgtgaaact 



cggcgatggt 

accggaagaa 

taagggcggc 

tcgtcagaga 

ggttcctttg 

tcggccacga 

gtcacattct 

cacaccacca 

gggtgtacaa 

tgctcgcata 

acccatacag 

gttggtccat 

ttcttaaagt . 

atttgcatca 

atctacgtgg 

acgacattgg 

tcgacgccac 

caggagcaat 

acgtcagcga 

cttctgacaa 

gcccactttt 

cttttttttt 



tgttgctatg 


60 


agaagaaagg 


120 


gattcctaag 


180 


cattatcgcc 


240 


gcctctttat 


300 


ctgtggacat 


360 


tcattctttc 


420 


gaaccatggc 


480 


gaaattgccc 


540 


tcctctctat 


600 


tagtttattt 


660 


aatgttcgtc 


720 


ctacggtgta 


780 


tcatggtcac 


840 


aggattaaca 


900 


aactcacgtg 


960 


gaaagcagct 


1020 


accgatccac 


1080 


cactggtgat 


1140 


a-tckaaaatc 


1200 


aaaattttta 


1260 


tctttttttt 


1320 




1350 



<210> 48 . 

<211> 386 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 48 

Met Val Val Ala Met Asp Gin Arg Thr Asn Val Asn Gly Asp Pro Gly 
1 5 10 15 

Ala Gly Asp Arg Lys Lys Glu Glu Arg Phe Asp Pro Ser Ala Gin Pro 
20 25 30 



Pro Phe Lys lie Gly Asp He Arg Ala TVla He Pro Lys His Cys Trp 
35 40 45 

Val Lys Ser Pro Leu Arg Ser Met Ser Tyr Val Val Arg Asp He He 
50 55 60 
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Ala Val Ala Ala Leu Ala He Ala Ala Val Tyr Val Asp Ser Trp Phe 
65 70 75 80 

Leu Trp Pro Leu Tyr Trp Ala Ala Gin Gly Thr Leu Phe Trp Ala He 
85 90 95 

Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asp He Pro 
100 105 110 

Leu Leu Asn Ser Val Val Gly His He Leu His Ser Phe He Leu Val 
115 120 125 

Pro Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His 
130 135 140 

Gly His Val Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Arg Val 
145 150 155 160 

Tyr Lys Lys Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro 
165 170 175 

Leu Pro Met Leu Ala Tyr Pro Leu Tyr Leu Cys Tyr Arg Ser Pro Gly 
180 185 190 

Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser 
195 200 205 

Glu Arg Lys Leu He Ala Thr Ser Thr Thr Cys Trp Ser He Met Phe 
210 215 220 

Val Ser Leu He Ala Leu Ser Phe Val Phe Gly Pro Leu Ala Val Leu 
225 230 235 240 

Lys Val Tyr Gly Val Pro Tyr He He Phe Val Met Trp Leu Asp Ala 
245 250 255 

Val Thr Tyr Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr 
260 265 270 

Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp 
275 280 285 

Arg Asp Tyr Gly He Phe Asn Asn He His His Asp He Gly Thr His 
290 295 300 

Val He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Asp 
305 310 315 ' 320 

Ala Thr Lys Ala Ala Lys His Val Leu Gly Arg Tyr Tyr Arg Glu Pro 
325 330 335 

Lys Thr Ser Gly Ala He Pro He His Leu Val Glu Ser Leu Val Ala 
340 345 350 

Ser He Lys Lys Asp His Tyr Val Ser Asp Thr Gly Asp He Val Phe 
355 360 365 

Tyr Glu Thr Asp Pro Asp Leu Tyr Val Tyr Ala Ser Asp Lys Ser Lys 
370 375 380 
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lie Asn 
385. 



<210> 49 

<211> 834 

<212> DNA 

<213> Pavlova sp. 



<400> 49 

atgatgttgg 

gacattgaca 

tctgtggtct 

atgacggggc 

tggtgcgtgg 

aaggtggacc 

aacaagtacg 

gtctccttcc 

aagctcggca 

atgatgtatt 

ctcacgcagg 

gtaacgggtg 

ctggtgctct 

gcgaaaaagc 



ccgcaggcta 
accccaacgg 
atctcagcgg 
tgaaggacta 
tggcctttct 
ttgggcccaa 
tggagctcct 
tccacgtcta 
atggtggtga 
cctactacac 
cacagctcgt. 
cctacccgtg 
tcacacgctt 
tcgcacagga 



tcttctagtg 
ggcctactcg 
tgtgtttggg 
catgttcact 
cctggaggtg 
ctccttcagg 
cgacacccta 
tcatcacgtg 
cgcatatttt 
catggcgctc 
gcagttttgc 
gcgaatttgc 
ctaccgccag 
ggcatcacag 



ctctcggccg 
acctcgtgga 
ctcacaaagt 
tacaatctct 
cggcgtgcgg 
ctcggcttcg 
tggatggtgc 
cttctgatgt 
ggcggtctca 
ctgggctggt 
atctgcctcg 
ttggtggagg 
gcctatgcca 
gccaaggcgg 



ctcgccagag 
ctggcctgcc 
acttcgagaa 
accaggtgat 
gcatgtcact 
tcacgtgggt 
tgcgcaagaa 
gggcctggtt 
tgaactcgat 
catgcccctg 
cccactccac 
tgtgggtgat 
aggaggcgaa 
tcaaggcgga 



cttccagcag 
cattgtgatg 
ccggaagccc 
catcaacgtg 
catcggcaat 
gcactacaac 
gacgcagcag 
cgttgtcgtc 
catccacgtg 
gaagcgctac 
atgggcggca 
ggtgtccatg 
ggccaaggag 
gtaa 



<210> 50 

<211> 277 

<212> PRT 

<213> Pavlova sp. 

<400> 50 

Met Met Leu Ala Ala Gly Tyr Leu Leu Val Leu Ser Ala Ala Arg Gin 
1 5 10 15 

Ser Phe Glii Gin Asp lie Asp Asn Pro Asn Gly Ala Tyr Ser Thr Ser 
20 25 30 

Trp Thr Gly Leu Pro lie Val Met Ser Val Val Tyr Leu Ser Gly Val 
35 40 45 

Phe Gly Leu Thr Lys Tyr Phe Glu Asn Arg Lys Pro Met Thr Gly Leu 
50 55 60 

Lys Asp Tyr Met Phe Thr Tyr Asn Leu Tyr Gin Val lie lie Asn Val 
65 70 75 80 

Trp Cys Val Val Ala Phe Leu Leu Glu Val Arg Arg Ala Gly Met Ser 
85 90 95 

Leu lie Gly Asn Lys Val Asp Leu Gly Pro Asn Ser Phe Arg Leu Gly 
100 105 110 

Phe Val Thr Trp Val His Tyr Asn Asn Lys Tyr Val Glu Leu Leu Asp 
115 120 125 

Thr Leu Trp Met Val Leu Arg Lys Lys Thr Gin Gin Val Ser Phe Leu 
130 135 140 

His Val Tyr His His Val Leu Leu Met Trp Ala Trp Phe Val Val Val 
145 150 155 160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
834 
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Lys Leu Gly Asn 



Gly Gly ASP Ala Tyr Phe Gly Gly Leu Met Asn Ser 



165 



170 



lie lie His Val Met Met Tyr Ser Tyr Tyr Thr Met Ala Leu Leu Gly 
180 

ser cv, pro T.P L.s Tyr 1.U T« Oln Ala OXn :.e« val «n 



195 



Phe Cys He Cys Leu 
210 



Ala His ser Thr Trp Ala Ala Val Thr Gly Ala 



215 



220 



Tyr Pro Trp Arg He Cys Leu Val Glu Val Trp Val Met Val Ser Met 



225 



Leu Val Leu 



230 



Phe Thr Arg Phe Tyr Arg Gin Ala Tyr Ala Lys Glu Ala 



245 



250 



Lys Ala Lys Glu Ala Lys Lys Leu Ala Gin Glu Ala Ser Gin Ala Lys 

260 



Ala Val Lys Ala Glu 
275 

<210> 51 
<211> 1530 
<212> DNA 

<213> Schizochytrium aggregatum 



<400> 51 
atgacggtgg 
gacgacgcct 
caccccgggg 
acgtaccacg 
gccgccgccg 
ttttacaagg 
cgcggcggcc 
ctctacctca 
ggcatcgtcg 
gccttctctc 
gccatgacct 
atggagaacg 
gagagcgacc 
aagcgcttct 
atcaacaagg 
gatgccaact 
ctcaccgtcc 
aagctcttct 
aaccacatca 
ccgccgcgca 
gagaaggcag 
tcggtcaact 
cagatcgagc 
gtggtgcagg 
tccgcctact 
tgggagaagg 



gcggcgatga 
ggtgcgccat 
gggacatcat 
tgcgccccat 
gcaaggatga 
tgctccgcca 
ccgagatctg 
tgtgcaccct 
ccgccttcgt 
cgttcatgaa 
gggagatgca 
gcacccaaaa 
cggacgtctt 
accaccgctt 
tgatcaccca 
gccgttacgc 
tctacatggt 
tcatcgccca 
tcgagggcgt 
cggtgcacgg 
ccaccaagca 
ggtcgatcgg 
accacctctt 
cgacgtgcgc 
tcaagatgct 
accaccccaa 



ggtgtacagc 
ccacggcgag 
cttgctggcc 
ctccgacgcg 
gccggccaac 
gcgtgtcgtg 
gatcaaggcc 
ggacccgaac 
cggcacgtgc 
caagctctct 
gcacgtgctg 
ggtcacccac 
cagcacctac 
ccagcacctg 
ggatgtggga 
cagcaagtcg 
cgccctcccc 
cttttcgtgc 
ctcgtacgcc 
cgtgaccccg 
cgtgccgctc 
ctcgtggttc 
ccccggcctc 
cgagtacggg 
ctcccacctt 
gtccaagtga 



atggcgcagg 
gtgtacgagc 
gccggcaagg 
gtcctgcgca 
gacagcacct 
gcgcgcctcg 
gccatcctcg 
cgcggcgcca 
attcagcacg 
ggctggacgc 
ggccaccacc 
gccgacgtcg 
cccatgctcc 
tacgcgccgc 
gttgtcctca 
tacgttgcgc 
gtgtacaccc 
ggcgagctgc 
tccaaggact 
atgcatgaca 
aacgactggg 
tggaaccact 
acccacacca 
gtcccgtacc 
cgggcgctcg 



tgcgcgacca 
tgaccaagtt 
aggccaccat 
agtaccgcat 
actacagctg 
aggagcgcaa 
tcagcggctt 
tcctggccgc 
acggcaacca 
tcgacatgat 
cgtacaccaa 
accccaagaa 
gtctgcaccc 
tgctcttcgg 
gcaagcgtct 
gcttctggat 
agggccttgt 
tggccaccat 
ctgtcaaggg 
cccgcgacgc 
ccgcggtcca 
tctccggcgg 
cctacgtgta 
agtcggagca 
gcaacgagcc 



caacaccccg 
cgcccgcacc 
cctgttcgag 
cggcaagctc 
ggacagcgac 
gatcgcccgc 
ctggtccatg 
catcgcgctg 
cggcgcgttc 
cggcgccagt 
cctgatcgag 
ggccgaccag 
gtggcaccgc 
tttcatgacc 
gtttcagatc 
catgaagctg 
cgacgggctc 
gttcatcgtc 
caccatggcg 
gctcggcaag 
gtgccagacc 
gctcaaccac 
cattcaggat 
gagcctcttc 
gatgccctcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1530 
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<212> PRT 

<213> Schizochytrium aggregatum 

<400> 52 

Met Thr Val Gly Gly Asp Glu Val Tyr Ser Met Ala Gin Val Arg Asp 
1 5 10 15 

His Asn Thr Pro Asp Asp Ala Trp Cys Ala He His Gly Glu Val Tyr 
20 25 30 

Glu Leu Thr Lys Phe Ala Arg Thr His Pro Gly Gly Asp He He Leu 
35 40 45 

Leu - Ala Ala Gly Lys Glu Ala Thr He Leu Phe Glu Thr Tyr His Val 
50 55 60 

Arg Pro He Ser Asp Ala Val Leu Arg Lys Tyr Arg He Gly Lys Leu 
65 70 75 80 

Ala Ala Ala Gly Lys Asp Glu Pro Ala Asn Asp Ser Thr Tyr Tyr Ser 
85 90 95 

Trp Asp Ser Asp Phe Tyr Lys Val Leu Arg Gin Arg Val Val Ala Arg 
100 105 110 

Leu Glu Glu Arg Lys He Ala Arg Arg Gly Gly Pro Glu He Trp He 
115 120 125 

Lys Ala Ala He Leu Val Ser Gly Phe Trp Ser Met Leu Tyr Leu Met 
130 135 140 

Cys Thr Leu Asp Pro Asn Arg Gly Ala He Leu Ala Ala He Ala Leu 
145 150 155' 160 

Gly He Val Ala Ala Phe Val Gly Thr Cys He Gin His Asp Gly Asn 
165 170 175 

His Gly Ala Phe Ala Phe Ser Pro Phe Met Asn Lys Leu Ser Gly Trp 
180 185 190 

Thr Leu Asp Met He Gly Ala Ser Ala Met Thr Trp Glu Met Gin His 
195 200 205 

Val Leu Gly His His Pro Tyr Thr Asn Leu He Glu Met Glu Asn Gly 
210 215 220 

Thr Gin Lys Val Thr His Ala Asp Val Asp Pro Lys Lys Ala Asp Gin 
225 230 235 240 

Glu Ser Asp Pro Asp Val Phe Ser Thr Tyr Pro Met Leu Arg Leu His 
245 250 255 

Pro Trp His Arg Lys Arg Phe Tyr His Arg Phe Gin His Leu Tyr Ala 
260 265 270 

Pro Leu Leu Phe Gly Phe Met Thr He Asn Lys Val He Thr Gin Asp 
275 280 285 

Val Gly Val Val Leu Ser Lys Arg Leu Phe Gin He Asp Ala Asn Cys 
290 295 300 

25 



wo 2004/071467 



PCT/US2004/005758 



ryr Ala se. Se. .y. val Ala A., P« Trp 11. «.<= Ly, L.u 

305 

I,au Thr ,al L.« Tyr «.t Val Ala L,. Pro Val I,. Thr Gl„ Gl, L.u 

325 330 

val ASP Gly Leu I.ys Leu Phe Phe Xle Ala His Phe Ser Cys Gly Glu 

340 345 

I.eu I.eu Ala Thr Met Phe He Val Asn His He He Glu Gly Val Ser 

355 360 
Tyr Ala Ser Lys Asp Ser Val Lys Gly Thr Met Ala Pro Pro Arg Thr 

370 375 
val His Gly val Thr Pro Met His Asp Thr Arg Asp Ala Leu Gly Lys 
385 390 395 

Glu Lys Ala Ala Thr Lys His Val Pro Leu Asn Asp Trp Ala Ala Val 
405 

Gin cys Gin Thr Ser Val Asn Trp Ser He Gly Ser Trp Phe Trp Asn 



405 

Val Asn Trp 
420 425 

His Phe ser Gly Gly Leu Asn His Gin He Glu His His Leu Phe Pro 

435 440 
Gly Leu Thr His Thr Thr Tyr Val Tyr He Gin Asp Val Val Gin Ala 

450 455 * . 

Thr cys Ala Glu Tyr Gly Val Pro Tyr Gin Ser Glu Gin Ser Leu Phe 
465 ^"^^ 

ser Ala Tyr Phe Lys Met Leu Ser His Leu Arg Ala Leu Gly Asn Glu 



485 



Pro Met Pro Ser Trp Glu Lys Asp His Pro Lys Ser Lys 



500 505 



<210> 53 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> synthetic oligonucleotn-de 

<400> 53 

gcggccgcat gactgaggat aagacga 

<210> 54 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> , ^. , 

<223> synthetic oligonucleotide 
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<400> 54 

gcggccgctt agtccgactt ggccttg 

<210> 55 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 55 

gcggccgcat ggagtcgatt gcgc 

<210> 56 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

<400> 56 

gcggccgctt actgcaactt cctt 

<210> 57 - 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

<400> 57 

gcggccgcat gggaacggac caag 

<210> 58 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 58 

gcggccgcct actcttcctt ggga 

<210> 59 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 59 

ttcctgcagg ctagcctaag tacgtactc 



PCT/US2004/005758 
27 

24 
24 
24 
24 



<210> 60 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 

<220> , ^. , 

<223> synthetic oligonucleotide 

<400> 60 21 
aagcggccgc ggtgatgact g 

<210> 61 

<211> 12 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 

Jhr^Lg^La Ala He Pro Lys His Cys Trp Val Lys 

1 ^ 

<210> 62 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> synthetic oligonucleotide 

<400> 62 ^ 36 

atccgcgccg ccatccccaa gcactgctgg gtcaag 

<21Q> 63 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 

Ma°Lu"he val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser 



<210> 64 

<211> 45 

<212> DNA 

<213> Artificial Sequence 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 
<222> (21).. (21) 

<223> y « c or t 

<220> 

<221> unsure 
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<222> (33). -(33) I 

<223> y « c or t 

<400> 64 

gccctcttcg tcctcggcca ygactgcggc cayggctcgt tctcg 

<210> 65 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 

<222> (4) . . (4) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (10).. (10) 

<223> r « a or g 



<220> 

<221> unsure 

<222> (30) . , (30) 

<223> r = a or g 



<220> 

<221> unsure 

<222> (31) . . (31) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (34) . . (34) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (38) . . (38) 

<223> r « a or g 



<220> 

<221> unsure 

<222> (39).. (39) 

<223> y « c or t 



<220> 

<221> unsure 

<222> (43).. (43) 

<223> r = a or g 



<400> 65 

gagrtggtar tgggggatct gggggaagar rtgrtggryg acrtg 

<210> 66 
<211> 15 
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<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 



consensus peptide 



<400> 66 . ^ ^-1 ,v 

Pro Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn 
15 10 15 



<210> 


67 


<211> 


45 


<212> 


DNA 






<220> 




<223> 


synthetic oligonucleotide 






<221> 


unsure 


<222> 


(9) . . (9) 


<223> 


y = c or t 


<220> 




<221> 


unsure 


<222> 


(27) . . (27) 


<223> 


y = c or t 


<220> 




<221> 


unsure 


<222> 


(36) . . (36) 


<223> 


y = c or t 


<220> 




<221> 


unsure 


<222> 


(39) . . (39) 


<223> 


y « c or t 


<400> 


67 



ccctaccayg gctggcgcat ctcgcaycgc acccaycayc agaac 



45 



<210> 


68 


<211> 


45 


<212> 


DNA 


<213> 


Artificial J 


<220> 




<223> 


synthetic o: 


<220> 




<221> 


unsure 


<222> 


(7) . . (7) 


<223> 


r = a or g 


<220> 




<221> 


unsure 


<222> 


(10) . . (10) 


<223> 


r = a or g 
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<220> 

<221> unsure 

<222> (19).. (19) 

<223> r « a or g 

<220> 

<221> unsure 

<222> (37).. (37) 

<223> r = a or g 

<400> 68 

gttctgrtgr tgggtccgrt gcgagatgcg ccagccrtgg taggg 

<210> 69 

<211> 12 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> consensus peptide 
<220> 

<221> UNSURE 

<222> (5).. (5) 

<223> Xaa = Asp or His 

<220> 

<221> UNSURE 

<222> (7) . . (7) 

<223> Xaa = Asp or Tyr 

<400> 69 

Gly Ser His Phe Xaa Pro Xaa Ser Asp Leu Phe Val 
1 5 10 

<210> 70 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 

<220> 

<221> unsure 

<222> (13).. (13) 

<223> s =» c or g 



<220> 

<221> unsure 

<222> (19) . . (19) 

<223> k = g or t 



<400> 70 

ggctcgcact tcsacccc)ca ctcggacctc ttcgtc 

<210> 71 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



synthetic oligonucleotide 



unsure 
(18) . . (18) 
m = a or c 



unsure 
(24) . . (24) 
w a or t 



<400> 71 

gacgaagagg tccgagtmgg ggtwgaagtg cgagcc 

<210> 72 

<211> 13 

<212> PRT 

<213> Artificial Sequence 



36 



<220> 
<223> 



consensus peptide 



<220> 
<221> 
<222> 
<223> 



UNSURE 
(3) . . (3) 

Xaa = Tyr or Phe 



<220> 
<221> 
<222> 
<223> 



UNSURE 
(4) . - (4) 

Xaa = Leu or Val 



<220> 
<221> 
<222> 
<223> 



UNSURE 
(11) . . (11) 
Xaa *= Leu or lie 



<400> 72 , ^ 

Trp Ser Xaa Xaa Arg Gly Gly Leu Thr Thr Xaa Asp Arg 
1 5 10 



<210> 


73 


<211> 


39 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<220> 




<221> 


unsure 


<222> 


(9) , . (9) 


<223> 


k = g or t 


<220> 




<221> 


unsure 
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<222> (30).. (30) 
<223> w = a or t 



<220> 

<221> \insure 

<222> (32).. (32) 

<223> s « c or g 

<400> 73 

gcgctggakg gtggtgaggc cgccgcggaw gsacgacca 



39 



<2ia> 


74 


<211> 


15 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


consensus peptide 


<400> 


74 


His His Asp lie Gly Thr His Val : 


1 


5 


<210> 


75 


<211> 


45 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<220> 




<221> 


unsure 


<222> 


(13) . . (13) 


<223> 


r = a or g 


<220> 




<221> 


unsure 


<222> 


(16).. (16) 


<223> 


r = a or g 


<220> 




<221> 


unsure 


<222> 


(25) . . (25) 


<223> 


r « a or g 


<220> 




<221> 


unsure 


<222> 


(40) . . (40) 


<223> 


r = a or g 


<220> 




<221> 


unsure 


<222> 


(43) . . (43) 


<223> 


r — a or g 


<400> 


75 



10 



15 



ctgggggaag agrtgrtgga tgacrtgggt gccgatgtcr tgrtg 



45 
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<210> 76 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 
<220> 

<221> UNSURE 

<222> (2).. (2) 

<223> Xaa « Leu or Phe 

<220> 

<221> UNSURE 

<222> (5) . • (5) 

<223> Xaa « Gin or Lys 

<220> 

<221> UNSURE 

<222> (12).. (12) 

<223> Xaa = Val or lie 

nifiaa'phe Pro Xaa He Pro His Tyr His Leu Xaa Glu Ala Thr 
1 5 10 1^ 

<210> 77 

<211> 45 

<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> synthetic oligonucleotide 

<220> 

<221> unsure 

<222> (12).. (12) 

<223> y - c or t 

<220> 

<221> unsure 

<222> (16).. (16) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (22).. (22) 

<223> r « a or g 

<220> 

<221> unsure 

<222> (33) . . (33) 

<223> k = g or t 

<220> 

<221> unsure 

<222> (42) . . (42) 

<223> r a or g 
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<220> 

<221> unsure 

<222> (43) . . (43) 

<223> r = a or g 

<400> 77 

ggtggcctcg aygagrtggt artgggggat ctkggggaag arrtg 45 

<210> 78 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 

<220> 

<221> UNSURE 

<222> (3).. (3) 

<223> Xaa = Ala or lie 

<220> 

<221> UNSURE 

<222> (6).. (6) 

<223> Xaa = Leu or Phe 

<400> 78 

His Val Xaa His His Xaa Phe Pro Gin lie Pro His Tyr His Leu 
1 5 10 15 



<210> 


79 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


79 


tacgcgtacc tcacgtactc gctcg 


<210> 


80 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Secjuence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


80 


ttcttgcacc acaacgacga agcgacg 


<210> 


81 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 



25 



27 



<220> 

<223> synthetic oligonucleotide 
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<400> 81 

ggagtggacg tacgtcaagg gcaac 

<210> 82 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



25 



<220> 
<223> 



synthetic oligonucleotide 



<400> 82 

tcaagggcaa cctctcgagc gtcgac 



26 



<210> 83 

<211> 31 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 83 

cccagtcacg acgttgtaaa acgacggcca g 



31 



<210> 84 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> ^ ^.^ 

<223> synthetic oligonucleotide 



<400> 84 

agcggataac aatttcacac aggaaacagc 
<210> 85 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> synthetic oligonucleotide 

<400> 85 

ggtaaaagat ctcgtccttg tcgatgttgc 

<210> 86 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



30 



<220> , 

<223> synthetic oligonucleotide 

<400> 86 

gtcaaagtgg ctcatcgtgc 

<210> 87 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



20 
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<220> 

<223> synthetic oligonucleotide 

<400> 87 

cgagcgagta cgtgaggtac gcgtac 

<210> 88 

<211> 45 . 

<212> DNA 

<213> Artificial Sequence 



26 



<220> 
<223> 



synthetic oligonucleotide 



<400> 88 

tcaacagaat tcatgaccga ggataagacg aaggtcgagt tcccg 

<210> 89 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



45 



<220> 
<223> 



synthetic oligonucleotide 



<400> 89 

aaaagaaagc ttcgcttcct agtcttagtc cgacttggcc ttggc 

<210> 90 

<211> 3979 

<212> DNA 

<213> Glycine max 



45 



<400> 90 

ggccgcagat 

aactcacatc 

taaatattaa 

tgtataaaat 

tttttaattt 

aaaagcattc 

tgagtggtgt 

agacaaataa 

tcaattggtt 

cccaatctaa 

aggggctttt 

atgctcccct 

tctctctgct 

tcctagattc 

tcctcaactg 

aatacgcgtt 

agcttatgag 

gactgagatg 

tctgtagttt 

agtataggtc 

ccagtttgtt 

ttttttatgg 

gtactaactc 

gcggccaccg 

aaacgcatta 

gctctcaaaa 



ttaggtgaca 
caaacataac 
tcattatttt 
tatattattc 
cttaaaaaat 
atcatttaat 
gggagtaggc 
aaagtatgca 
gctacattca 
gccgcggatg 
ccgtcattaa 
ctaaactcgt 
cttctcttct 
cgtggttcaa 
ggtccatcta 
ggactttcag 
aaataaaatc 
gggtttggtg 
tgttttgttt 
acaataggaa 
tgtatatatg 
ctgaaatttt 
taggcttgtt 
cttccagaac 
ctagatccac 
tcaaatgttc 



ctatagaata 
atggatatct 
taagatatta 
atgatttttc 
gttgcaagac 
acattaaaaa 
aacctggcat 
acaaacaaat 
attttcaact 
caaacggttg 
ctcacccctg 
atcgcttcag 
cttctacctc 
tttcggatct 
ggatccatgt 
atctagtcga 
actttttttt 
attacagaat 
tcttgtttct 
ttcaaacttt 
tttaaaaaat 
tgcatgtgtc 
gtgcagtttt 
cacccgattc 
cctccctctc 
catctccaaa 



tgcatcacta 
tccttaccaa 
attaagaaat 
atacatttga 
acttattaga 
atatttaata 
tgaaacgaga 
caaaatcaaa 
cagtcaacgg 
aatctaaccc 
ccacccggtt 
agttgagacc 
tcaaggtact 
tgcacttctg 
gaaactctac 
aatcatttca 
tatttcaaaa 
tttagcgaat 
catacattcc 
gagcagggga 
gaaacttttg 
tttgctctct 
tgaagtataa 
tcttcttcct 
tctcatcaaa 
ccccccacgg 



gtaagctttg 
tcatactaat 
taaaagattt 
ttttgataat 
catagtcttg 
ctaacagtag 
gaaagagagt 
gggcaaaggc 
ttgagattca 
acaatccaat 
tccctataaa 
aagacacact 
tttcttctcc 
gtttgctttg 
tctttcttta 
taattgcctt 
taaaccttgg 
tttgtaattg 
ttaggcttca 
attaatccct 
ctttaaattc 
gttgtaaatt 
ccatgccaca 
cttcacaccc 
ccctcaccaa 
cggcgccctt 



ctctagatca 
tattttgggt 
tttaaaaaaa 
aaatatattt 
ttctgtttac 
aatcttcttg 
cagaaccaga 
tggggttggc 
ctctgacttc 
ctcgttactt 
ttggaactca 
cgttcatata 
ctctaccaaa 
ccttgctttt 
atatctgcgg 
tctttctttt 
gccttgtgct 
tacttgtttg 
attttattcg 
tccttcaaat 
tattataact 
tactgtttag 
caacacaatg 
caccttcccc 
acccaaccac 
caccaaggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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gcgccgacca 
gacatccttg 
ggtgcgtcga 
ccgcgccacg 
cccggcgtct 
gacgctttaa 
atcggcaccg 
cacaactacc 
gtcgccacct 
caactcgccg 
cccaggcccc 
aagcccgttc 
gttgaactca 
ggtgatgaat 
gttgacaata 
aagcttgagg 
attgggaaga 
ggaattaata 
agagaagaga 
gcgatttctc 
gttagtactg 
ccgaggcagt 
attggtgctg 
ttcatcatga 
ttgttgttga 
tccaatagag 
atgctcaagt 
cttagagcgg 
gtgccccatc 
ataactgagg 
ttgttttgta 
agcatcatag 
actacatgca 
gccgttggat 
tagaaatcta 
tcaatgtttt 
acctaggcaa 
ggaaattgtg 
ctcttttgct 
aaatggcaaa 
tggcatcacg 



cggagccctt 
tggaggcgct 
tggagatcca 
agcagggcgg 
gcattgccac 
tggacagcgt 
acgccttcca 
tcatcctcga 
ccggccgccc 
tgcctaattg 
ccgccgaggc 
tctacgtcgg 
ctggtattcc 
attcccttca 
gtgatttgtt 
cttttgctag 
acaagcaggc 
tgattttgga 
ttaatgtgca 
cgcagcatgc 
gggttgggca 
ggttgacctc 
ctgttgctaa 
atgttcagga 
acaatcagca 
ctcacaccta 
ttgctgatgc 
caattcagag 
aggagcatgt 
gtgatggtag 
caatatatat 
tctgtagtag 
gtagcatcta 
tttttgctgt 
agtagagaat 
tcttttttta 
tcagtaaaat 
accaatttgt 
cgatcttcac 
gttataaata 
tttctagtt 



cgtgtcacgg 
ggagaggcag 
ccaggcgctc 
cgtcttcgcc 
ctccggcccc 
cccagtcgtc 
agaaaccccg 
cgtcgacgac 
cggtccggtc 
ggacgagccc 
ccaattggaa 
cggtggcagt 
cgttgctagc 
gatgctgggt 
gcttgccttt 
tagggctaag 
gcacgtgtcg 
ggagaaagga 
gaaacacaag 
tatcgaggtt 
gcatcaaatg 
agggggtctt 
ccctggggct 
gttggccact 
tttgggtatg 
tcttggagat 
ttgtgggata 
aatgttggac 
gttgccgatg 
aacgaggtac 
aagataatgc 
ttttggtagc 
tctatctctg 
agtgagactg 
ctgttgaaga 
gcggttggta 
gcatattcct 
tcatgtagaa 
aaagcgatac 
gtaaaacaga 



ttcgcctccg 
ggcgtgacga 
acgcgctccg 
gccgaaggct 
ggcgccacca 
gccatcaccg 
atcgtggagg 
atcccccgcg 
ctcatcgaca 
gttaacctcc 
cacattgtca 
ttgaattcca 
actttaatgg 
atgcatggta 

ggggtaaggt 

attgttcaca 
gtttgcgcgg 
gtggagggta 
tttccattgg 
cttgatgagt 
tgggctgcgc 
ggagccatgg 
gttgtggttg 
ataagagtgg 
gtggttcagt 
ccgtctagcg 
ccggcagcgc 
acccctggcc 
attcccagta 
tgattgccta 
tgtcctagtt 
aagacatttt 
tagtctgata 
aaaatgatgt 
agtcaaaagc 
gacgtgtaga 
tttttaactt 
cggatttgga 
cgaaatccag 
atagatgctg 



gcgaacctcg 
cggtgttcgc 
ccgccatccg 
acgcgcgttc 
acctcgtgag 
gccaggtcgc 
tgagcagatc 
tcgtcgccga 
ttcccaaaga 
ccggttaqct 
gactcatcat 
gtgctgaatt 
gtcttggaac 
ctgtttatgc 
ttgatgaccg 
ttgatattga 
atttgaagtt 
agtttgatct 
gttacaagac 
tgactaatgg 
agttttacaa 
gttttggatt 
acattgatgg 
agaatctccc 
tggaggatag 
agagcgagat 
gagtgacgaa 
cctaccttct 
atggatcctt 
gaccaaatgt 
gcaggatttg 
attttccttt 
tctcctgttg 
gctagtaata 
taatggaatc 
ttcaacttct 
gccatttatt 
ccattgcgtc 
agatagtttt 
taatcgactt 



caagggcgcg 
gtaccccggc 
caacgtgctc 
ctccggcctc 
cggcctcgcc 
ccgccggatg 
catcacgaag 
ggctttcttc 
cgttcagcag 
cgccaggctg 
ggaggcccaa 
gaggcgcttt 
ttttcctatt 
taactatgct 
tgttactggg 
ttctgccgag 
ggccttgaag 
tggaggttgg 
attccaggac 
agatgctatt 
gtacaagaga 
gcctgcggct 
ggatggtagt 
agttaagata 
gttctacaag 
attcccaaac 
gaaggaagag 
tgatgtcatt 
caaggatgtg 
tccttgatgc 
gcctgtggtg 
tatttaactt 
tctgtattgt 
atatttctgt 
aggttacata 
cttggagctc 
tacttttagt 
cacaaaacgt 
caaaagtcag 
caataacaag 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

3979 



<210> 91 

<211> 17 

<212> DNA 

<213> Artificial Sequence 

<220> , ^.^ 

<223> synthetic oligonucleotide 



<400> 91 

tgcggccgca tgagccg 

<210> 92 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



17 



<220> , ^.^ 

<223> synthetic oligonucleotide 
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<400> 92 

acgtacggta ccatctgcta atattttaaa tc 



<210> 


93 


<211> 


20 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


93 


taatacgact cactattagg 


<210> 


94 


<211> 


35 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


94 



tgcccatgat gttggccgca ggctatcttc tagtg - 35 

<210> 95 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<210> 96 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 96 

gccaattgga gcgagttcca atctc 25 

<210> 97 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<220> 
<223> 



synthetic, oligonucleotide . 



<400> 95 

gctgtcaacg atacgctacg taacg 



25 



<400> 97 

gcgatatccg tttcttctga ccttcatc 



28 



<210> 98 
<211> 28 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 98 28 
ttctagacct gcaggatata atgagccg 



40 
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